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RITF-DIRECTEn TRANSFORM ATION OF PLANTS 



The invention relates to methods for plant transformation. In particular for the precise 
integration of foreign DNA Into defined sites In the genome of a plant. The Invention also 
relates to transgenic plants obtained by such methods. 

In recent years, the development of genetic engineering techniques has had dramatic 
implications In the field of crop improvement. Using these techniques, beneficial traits can 
be Introduced into almost any crop and such improved crops can be rapidly obtained 
Instead of having to go through the lengthy process of introducing the desired trait by 
conventional breeding methods. 

However, present plant transfomiation methods generally lead to integration of the 
transgene at random locations in the host genome. This results in several problems, such 
as variability of transgene expression caused by different integration loci or so-called 
"positions effects", and the risi< of creating a mutation in the genome upon integration of the 
transgene into the host genome. Therefore, a large number of transformation events must 
be screened and tested in order to obtain a transgenic plant exhibiting the desired level of 
expression of the transgene without producing any abnormality resulting from the insertion 
of the transgene into an important locus in the host genome. Another disadvantage of 
random transgene integration is that If additional Uansgenes are to be subsequently added 
to a transgenic plant, integration of the additional trait at a genomic location different from 
the pre-existing transgene renders the breeding of both transgenes in elite lines 
cumbersome and difficult. 

One approach to obtain integration of a transgene at a predetermined location in the host 
genome is the use of homologous recombination. In this case, the incoming plasmid DNA 
molecule contains a stretch of DNA sequence identical to a particular genomic sequence 
and Integration of the incoming DNA occurs through reciprocal recognition of the 
homologous sequences by the nonnal recombination enzymes of the plant. Although 
homologous recombination is an efficient mechanism in yeast, fungi and certain types of 
mammalian cells, frequencies of homologous integration of a transgene in plants are 
extremely low compared to the background of random integrations, thus making the use of 
this approach for the genetic transformation of plants not generally feasible. 
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Ttoel»e, me« is an unl-ed need fo, imp^ed .ransfomaUon methods a,«a, fo, 
eae^eced lntegm«oo of a »ans,eno a. a p,ode,e™».ed genomic loca«o. into me hos. 



genome. 



The present invention relates to plant transformation and. because of the lack of 
predictability for the site of integration of a transgene In a plant genome using current 
transformation methods, the present invention addresses the need for precise mod.f.catons 
of plant genomes. In particular, the present invention provides methods for the Integration 
of transf om^ed DNA into a defined location In the host genome using a slte-specrf.c 
recombinase system and further provides transgenic plants obtained using the methods of 



the present invention. 

Site^pedfic recombinase systems typically comprise one or more prote.ns wh,ch mediate 
the specific Cleavage and ligation of two defined, asymmetric nucleotide sequences, the 
recombinase recognition sites. The recombinase recognition sites usually comprise two 
ir^verted repeats of about 13-bp separated by a spacer of about 8-bp. In some cases, the 
recognition site comprises three repeats, arranged as two in direct orientation, the 8-bp 
spacer and a tiilrd repeat In inverted orientation relative to the other two repeats. Because 
the recombinase mediates recognition and recombination between two recognition srtes .f 
two Identical recognition sites are present in a host genome in the continued presence of 
the corresponding recombinase activity, they can further recombine. thus creating an 
unstable chromosomal locus which is not desirable. It Is therefore an object of the present 
invention to use combinations of different recombinase recognition sites for slte-speafic 
ir^tegration of a transgene into the host genome, resulting in a stable chromosomal locus 
after recombinase-mediated integration of a transgene. 

using a method of the present invention, it Is tt^erefore possible to avoid the variability of 
transgene expression usually caused by -position effects" of transgene Insertion m 
transgenic plants or by different numbers of copies of the transgene Integrated into the host 
genome. Additionally, by using the method of the present invention, the poss,b.l.ty of 
mutagenesis due to random insertion of the transgene is avoided. Furthem^ore. the present 
methods also allow for additional transgenes to be inserted at the predetem,ined genome 
location instead of being scattered throughout the genome, thereby, e.g.. stacking 
transgenes at the pre^etemiined genome location. Furthermore, using a method of the 
present invention transgenes can be exchanged or no longer useful transgenes can be 
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The present invention therefore provides: 

A method tor site-specific modification of a plant genome comprising the steps of: a) 
obtaining a plant cell stably transformed with a first DNA sequence comprising two 
ells: recognition siteswhichdifferfromone another inthelr^^^ 
into said plant cellasecond DNA sequence comprising the two reoomb-nasere^^^^^^^^ 
s,tescorrespondingtotherecombinaserecogn.onsltesofthef.rstDNAsequ^^^^^^^^ 
presence o, active recombinase or reoombinases; c) identifying a recomb.nase-med.ated 
integration of the second DNA sequence at the chromosomal location of the f.rst DNA 
sequence. In a prefened embodiment, a nucleotide sequence is added at the pre- 
determined genome location. In another preferred embodiment, a n-.e°t.de -qu-^ at 
the pre-detem^ined genome location is removed and preferably replaced w.th another 
nucleotide sequence. ♦.^^^lot^^hip 
™ssa,,g,,BNAa»,ec>*,o,«ansisn«,exp,essedJ.e.s^dplan,ce>r.s.a*.ran*™^^ 
„» , DNA sequence composing the codmg region of at teast one reoomblnase exp-essfcle 
in said plant cell. 

The two recombinase recognition sites are desirably derived from two deferent srte-speaf . 
recombination systems. More desirably, the two recombinase recognition sites are denved 
from the same site-specific recombination system. Preferably, the site-spec.f.c 
recombination system is selected from a group consisting of R/flS. G.n/g«cand FLP/FRT. 
More preferably, the site-specific recombination system is FLP/FflT. Yet more pref e^b^ 
the wild-type nucleotide sequence of the spacer is 5 '-TCT aga aa-3' (SEQIDNO.l) 
and a modified spacer of a recognition site of the present invention is selected from the 
group consisting of 5. -TTC AAA AG-3' (SEQIDNO:2).5'-rrc aaa ta-S- (SEQ ID 

, , ,QPn in 5 ' -TCT AGA AG-3 ' (SEQ ID NO.S) and 5 ' - 

N0:3), 5'-TCT ACT TA-3'(SEQIDN0.4), b l<-i \ 
TCT AGA TA-3 ' (SEQ ID N0:6). 

,n a preferred embodiment, the first DNA sequence further comprises outside of the DNA 
stretch comprised between the two recombinase recognffion sites the coding reg.on of a 
selectable maker gene or a portion thereof, preferably preferably a 3' portion of the cod,ng 
region of the selectable mariner gene, wherein the 5' end of said coding region is adjacent 
to one of the recombinase recognrtion sites, wherein said selectable marker gene or porton 
thereof is potentially expressible in said plant cell if a promoter were provided, but not 
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expressed. In a further preferred embodiment, the 3' end of said coding region is operably 
linked to temiination signals. In a further preferred embodiment, an intron or portion thereof 
Is operably linked to the 5' end of the coding region of a selectable maker gene or portron 
thereof, wherein the 5' end of said intron of portion thereof is adjacent to one of the 
recombinase recognition sites. In a further prefen-ed embodiment, the firat DNA sequence 
further comprises between the two recombinase recognition sites an expression cassette 
comprising a gene of interest expressible in said plant cell. For example the first DfsJA 
sequence further comprises an expression cassette comprising a visible or a negative 
selectable marker gene expressible in said plant cell. In another preferred embodiment, the 
second DNA sequence further comprises between the two recombinase recognition sites a 
promoter capable of directing the expression of a gene in a plant cell, wherein said 
promoter is adjacent to one of the recognition sites and is oriented such that It is capable of 
directing transcription towards the outsWe of the DNA stretch comprised between the two 
recombinase recognition sites, wherein a functional fusion between said coding region of 
the first DNA sequence and said promoter is created upon recombinase-mediated 
integration of the second DNA sequence at the chromosomal location of the first DNA 
sequence. In a further preferred embodiment, the second DNA sequence further comprises 
between the two recombinase recognition sites a promoter capable of directing the 
expression of a gene in a plant cell operably linked to a portion of the coding sequence of a 
selectable marker gene, preferably a 5' portion of the coding region of the selectable marker 
gene, preferably a portion of the coding region of the selectable mart<er gene not present in 
the first DNA sequence, wherein said portion of the coding sequence of the gene of interest 
fused to said promoter is adjacent to one of the recognition sites and is oriented such that 
the promoter is capable of directing transcription towards the outside of the DNA stretch 
comprised between the two recombinase recognition sites, wherein a functional fusion 
between the portion of the coding region of the selectable mari<er gene of the first DNA 
sequence and the portion of the coding region of the selectable mariner gene of the second 
DNA sequence is created upon recombinase-mediated integration of the second DNA 
sequence at the chromosomal location of the first DNA sequence. In a further preferred 
embodiment, an intron or portton thereof is operably linked to the 3' end of the portion of the 
coding region of the selectable maker gene. 

In another further preferred embodiment, the first DNA sequence further comprises 
between the two recombinase recognition sites an expression cassette comprising a gene 
of interest expressible in said plant cell. In another further embodiment, the second DNA 
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s«,uence »e, oonposes b«w«r, the t«o recombinase «009*a sites an expression 
Zlo=™P*9a«..-*er,enoexp,es.*ins.ap,antce«and«enng^- 

«„ na*„ m the tl,^ DNA sequence, in anotner turthe, embodinnen., ti,e second DMA 
a«,uence fu*e, comprises between tl,e .»o recombinase reco,niUon si.^ one or r,«re 
additionai recmbinase recognMon site dflereni Iron, the omer two recombtnase 
^ni«on Sites in sa'^ DNA sequence and ad|acent to tne ^ end o. the cod^i, o. a 
selectabie marker gene, in a preferred embodiment. tem*.«on signals are fused to ».e 3 
end ol lire codins region of tlie seiedabie marker gene. 

|„ anoti^er preferred embodiment, tne first ONA sequence further comprises between the 
«,o recombinase recogni«on s«es an expression cassette compdsing a select^ marker 
gene expressible in said plant cell. In a further prefened embodiment, the first DNA 
equenoe funher comprises between the two recombinase recogniCon sHes en expre^ion 
cassette compnsing a gene of Interest expressible „ saM plant cell, and desirabh, fur«,er 
comprises an exp,essk», cassette compdshg a visible o, negate selectable marks, gene 

expressible in said plant cell. ^ . „„„ 

inafurther preferred embodiment, the second DNA sequence further c^mpnsesbe^ee^ 

the two recombinase recognition sites an expression cassette comprising a selectable 
marker gene expres^ble in said plant cell, m another further preferred embodiment, the 

second DNAsequence further comprises between the two recombinase recogn. on s«e^^^^^^ 
expression cassette comprising a gene of interest expressible in said plant cell. In ano^^er 
further preferred embodiment, the second DNA sequence further comprises between the 
two recombinase recognition sites an expression cassette comprising a visible mariner gene 
expressible in said plant cell, the second DNA sequence further comprises ^etwe- me 
recombinase recognition sites one or more additional recombinase recogn.t.on s,te drfferent 
from the two other recombinase recognition sites of said DNA sequence. 
The method optionally further comprises the step d) of regenerating from a plant ceil 
comprising a DNA sequence as discussed above to obtain a fertile plant. 

The invention further provides: 

A plant cell stably transfom^ed wrth a DNA sequence comprising at least two different 
recombinase recognitton sites. In preferred embodiment, the plant cell further comprises 
outside of two recombinase recognition sites the coding region of a selectable maker gene, 
Wherein the 5' end of said coding region is adjacent to one of the recombinase recogn.t,on 
sites therein said selectable maricer gene is potentially expressible but not expressed .n 
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«M p«n. ce., and p«.era«y ^ .Ke 3' «^ =. ^ coding region ,s °P»-* » 
«n.n«*ns^.lna.u-*eren*«<i.«.«.ep.an.ce«,u*ercompnses^^^^^^^ 

«, ^conbtase ^oognMon sHes », .xp««lon oa«e«e compasu,g a v,s*,o madce, gene 

expressible in said plant cell. ^„Ki«ac^ 
,n another preferred embodiment, the plant cell further comprises between two recomb.nase 
recognition sites an expression cassette comprising a selectable marker gene exp^^ble .n 
saw plant cell. In a further embodiment, the plant cell further comprises between the two 
recombinase recognition sites an expression cassette comprising a visible or negafve 
selectable mariter gene expressible In said plant cell. 

The invention further provides: 

A plant comprising plant cells as described above and regenerated from any one of the 
plant cells described above or a descendant of such a plant having stably integrated .n .ts 
genome a DNA sequence as described above. 

The invention further provides: 

A plant obtained by any one of the methods described above or a descendant of such a 
plant having stably Integrated In Its genome a DNA sequence as described above. 

Within the context of the present invention "site-specific" means at a particular DNA 
sequence, which is in a specific location in the genome. The nucleotide sequence can be 
an endogenous nucleotide sequence of the genome at its natural location in the genome or 
. it can be a heterologous nucleotide sequence which has been previously Inserted into the 
genome by any of a variety of known methods. 

TO -identify- a recombinase-mediated integration means that either a selectable mariner 
gene is used that selects for the site-specific integration or that a visible or negative 
selectable mariner gene is used, whose loss identifies a site-specific integration. 

-Expression cassette" as used herein means a DNA sequence capable of directing 
expression of a particular nucleotide sequence in an appropriate host cell, compnsing a 
promoter operably linked to the nucleotide sequence of Interest which is operably linked to 
tem,ination signals. It also typically comprises sequences required for proper translation of 
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the nudeotldesequence.The coding region usually codesforaprotein Of .^^^^^^ 
also code forafunctlonalRNA of interest, for example antisenseRNAoranontr^^^^^^^ 
RI^ that. In the sense or antisense direction, inhibits expression of a particular gene. e.g.. 
antisense RNA. The expression cassette comprising the nucleotide sequence of interest 
,„ay be Chimeric, meaning that at least one of its cornponents is heterologous with respect 

,0 at least one of its other components. The expression cassette may also be one wh.ch « 
naturally occurring but has been obtained in a recombinant fom, useful for heterologous 
expression. Typically, however, the expression cassette is heterologous with respect to e 
host i e . the particular DNA sequence of the expression cassette does not occur naturally 
in the host cell and must have been introduced into the host cell or an ancestor of the host 
cell by a transfomiation event. The expression of the nucleotide sequence in the express.on 
cassette may be under the control of a constitutive promoter or of an inducible promoter 
which initiates transcription only when the host cell is exposed to some particular external 
stimulus, in the case of a multicellular organism, such as a plant, the promoter can also be 
specific to a particular tissue or organ or stage of development, 
•heterologous- as used herein means "of different natural or of synthetic origin". ..e. rt 
represents a non-natural state. For example, if a host cell is transfomied with a gene 
derived from another organism, particularly from another species, that gene is heterologous 
with respect to that host cell and also with respect to descendants of the host cell wh.ch 
cany that gene. Similarly, heterologous refers to a nucleotide sequence derived from and 
inserted into the same natural, original cell type, but which is present in a non-natural state, 
e g a different copy number, or under the control of different regulatory elements. 
-Regulatory elements" refer to sequences involved in conferring the expression of a 
nucleotide sequence. Regulatory elements comprise a promoter operably linked to the 
nucleotide sequence of interest and termination signals. They also typically encompass 
sequences required for proper translation of the nucleotide sequence. 

A "selectable maricer gene" refers to a gene whose expression In a plant cell gives the cell a 
selective advantage. The selective advantage possessed by the cells transfomied with the 
selectable mariner gene may be due to their ability to grow in presence of a negatwe 
selective agent, such as an antibtotic or a heri^idde. compared to the ability to grow of non- 
transfom^ed cells. The selective advantage possessed by the transfomied cells may also be 
due to their enhanced capacity, relative to non-transformed cells, to utilize an added 
compound as a nutrient, growth factor or energy source. A selective advantage possessed 
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caled twgatiw sotectioa-. In this, a compou-d b added (hat is to»o on y 
t;laspec^»»-(a™9a«-se^*n-<»'«-'^''''"""'^^'""'"" 
(typically a transgene). 

,«„e™d<e,,sas-..s.s,oa,s.e»hossexp™ss,o.doesnotcc„.a,a„advan.a,eto 
a transformed call but can be made detectable 0. visible. 

...„eo,,me.s.e,e.toan..ne..^^^ 

Ta— :rr:rrrpr:..-"-- 

IHm la perton^noe ». an industh^ P-ss o, aneted «p,oduc«ve cap^^^^^^ 
d.eJma,a^beone««t.t«ns,anad.op.an..or,hep^uCono, 

oommemially valuable enzymes or metabolites m the plant 

A -olanf «le« to any plan, o, pan o. a plan. a. any stage of development. Therein are also 
J^e^ X J or ^ue culfures and seed. As used In con,unCon ^ thep^sent 
1^1 ^ 1 -plantsssue- Includes, bu, Is ^, limited to, whole p,an«, plant «K 

organized into structural and/or functional units. 

Aplanfregenerated-.,omaplantoellmeans.ha.^ceteo.thep.antamda,.red.rom.ha. 
plant cell. 



I. Site-specific Recombinase Recognition Systems 

The present invention relates to rnethodsfortargeting the insertion ofatransgene,^^^ 
predetennined chromosomal location using site-directed recombinase systems. A s.te- 

s el recombinase system ^^^^^^^^ 
::r;specificnuLidese.uence.andc.eaveand|-.^^^ 

,le. site-specitic exchange Of genetic information, several site-spec.f. ^ 

are known in the art. These include, but are not limited to. e.g.. the bactenophage PI 
areknowninine , ,,q«.,p«„ 05. 729.736) the R/RS recombinase system from 

Cre/tox system (Austin et al. (1981) Cell 25. 729 /do,. me 
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an 8-C»p spacer, swii inverted orientation 

„p,a.,nd,«c,o.e„,a*n,me8-bp.pace,a„ a™«. ^^^^ 

rr.r.rrr::-.i---»- 

„pea.o«a„.a«onresu>«ln.he,nve«tond«s9aneBc™,e„anoca,aabe^^^^^^^ 

on two different molecules is possibie and since some 

..a^.a.nc..P.e.^— ^^^^^ 
potentially reptesent a powerful tool tor precise, sue st» 

II. Use olDfflerent Becomblnase Recognition Sites 

„ «.o idanscal recognition sites are present In the host genome after a '^"^""^ 
m,ec<^rleventanai,anaot.erecor,*inaselsstll,avaiUb.e,n.nenc^^s, 

rrreco,n«.nsltesare.keWtecon«nuetereoon.b,ne..bereby.ea.ng— 

lsFuJnnore,*Ke,n,,amo,ecula,exd.on,sKinet,ca»,.avoreaoverb™iec^^^^^^ 

r^on,A^r.e...,.S,S,P.n,0.r:e4..5S,.properin.egra«oneventscanbelos. 

before the plant cells become free of recombinase. 
.e..e,,teavo«,.er^nattne«<.»le.*o— -^^^^^^^^^ 

recombinase only transiently (see e.g. FuKushige and Sauer (1 992) PNA5 
TdueteUreunlavored tone., otwegration vs. exdsion.tbls strategy has on^rW^ 

^ZZ., strategy cons^ts o, crea«ng mutant re»g^n .tes ^ 
Clruon vs. ex^ion, such as « sUes containing mutated InveKed repeal ,A.er. e. 
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, ,1^, Plan. J 7- 649^59). In *ls case, to mMOo.s were <o to inv««d 

,n a further preferred embodiment, the two recognition sites are denved from the same 
in a turtner p ^.^^^^ ^^^.^ ^p^^^ example, 

recombination system and, in this case, me wo 
«,en used, a, leas, one o, to space, o, to s,es ,s 

used a. leas, one o, to spaoe. o. to .,x s»es is tod»,ed, *en F/HS . used, a, ^a^ 
useu, anua CI o/iTDT id iQpd at east one of the 

one 0. to spacers o, to flS s«es Is n^*ed, w.,en ^"'^^^"^■''^ ^rpiFRT^ 
^cecs 0, to FRTsUasIs n«d«ied. ,n a,.*e,p,e,ened en*«.»nen, to ^/WT^ 
* * ,eco.*ase s,s.en, Is used and to «o «co„«nase - ^« , 

1. ,n .nis case, to »»d-we nuc,e„«e sequence o. to space, . S -TCT 3 
,SEO » NO , ). in a prelened embodlmen,. a desUed m«*d recogn*on s«e shou« n« 
r«rpo,, y Jlne and p<.,pu,to »ac« *n ex^nd ,^ to ^P-*^-;-^" 
::::jd:peaUana™duc,^o.to.*ATc.n»n.snou««a^^^^^^^^^ 

.o^f,ed,ecognl«ons«eo.top,esen,^en.^^S.-T^ «x 

Otor p,e.e.edmod.ied«cognl«cn8»esa«5-rrc «A = 

L L ,SEQ,DN0-.4,,S.-TCT .=-3. (SEQ.DNO-.5)orS.-TCT 

TA-3.(SEQIDNO:6).lnafc*e,p«.«redemboditon..tofVRSs«e^c*c 

Icomlse sys^n, . used and to «,o recognl*. s«e, ace RS s.es. In a 
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TT-3 ' , preferably 5 ' -cgt-3 ' . 

HI. Evaluation of the Recombinase Recognition Sites in Plants 

,t is known in the art that plants are deferent from other Wngdoms in many aspec^an^ 

resuitsobtainedindifferentkingdomscannotpredidablybeext,Bpolatedto 

Therefore, the inventors of the present invention first tested the element of the slte-specrf.c 

recombinase system in plants, in particular the recombinase recognition sites. 

The FLP recombinase responsive reporter gene constructs are assayed in the presence 

and absence of FLP recombinase. A stuffer fragment surrounded by two recombinase 

recognition sites is inserted between the coding sequence of a luciferase gene and a 

promoter. The stuffer fragment flanked by FflT sites is excised from the untranslated 

leader of the luciferase gene only in the presence of FLP recombinase. resulting ,n 

uanscripfion of the luciferase gene and luciferase acth/ity. For the wild-type FRTsites the 

luciferase acUvrty in presence of active recombinase is 73-foid over the luciferase actv.^ -n 

the absence of the recombinase. The modified P and P sites are also recogn.zed by FLP 

recombinase. and in the presence of the recombinase the stuffer fragments flanked by P 

and P sites are also excised from the untranslated leader of the luciferase gene, resultmg 

in luciferase activity of 18-fold and 38.fold respectively over that of pAT206 and pAT207 .n 

the absence of recombinase (compare example 3). 

The cross-reactivity between wlld-type and modified FflTsites is also tested. Luaferase 
activity m the presence of FLP recombinase Is tested on constmcts containing stuffer 
fragments flanked by wlld-type and P sites, wild-type and P sites and P and F sites. 
(nAT210 21 1 and 213). Luciferase activity of these constructs in the presence of 
recombinaseisonlytwo-foldoverthatofthepAT163vector(stufferflankedbywild-W^^ 

sites) in the absence of recombinase. demonstrating that combinations of the wild-type and 
modified FflT sites are not cross-reactive (compare example 3). 
For the first time, the inventors of the present invention show that these elements are 
functional and interact specifically in plants. 
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IV. Recipient Lines and Donor Constmcts 

The present invention provides a method to cany cut site-specific integration of a transgene 
using two different recombination recognition sites and comprising the steps of: a) obtaming 
a plant ceil (recipient cell) stably transfomied with a first DNA sequence (recipient construct) 
comprising two recombinase recognition sites which differ from one another in their spacer; 
b) introducing into said plant cell a second DNA sequence (donor constmct) comprismg the 
two recombinase recognition sites corresponding to the recombinase recognition sites of 
the first DNA sequence, in the presence of active recombinase: c) selecting and screening 
for a recombinase-mediated integration of the second DNA sequence at the chromosomal 
location of the first DNA sequence. 

A recipient cell of the present invention is obtained by a transfomiation method as 
described below or by methods otherwise known in the art. Such transgenic line preferably 
contains a single copy of the recipient constmct integrated in its genome. Once such a line 
has been identified, it is further characterized. For example, the location of the transgene 
insertion is precisely detemiined by genetic methods well known in the art. Additionally, the 
host plant DNA flanking the site of insertion is sequenced and it is ascertained that no 
essential gene is mutated by the insertion of the transgene. Once a well characterized line 
is obtained, it is used as a recipient for subsequent introduction of additional transgenes. 
Such additional transgenes are comprised in the donor construct and are introduced into 
the recipient line by transfomiation methods as described below or othenwise known in the 



art 



In a preferred embodiment, the two recombinase recognition sites are in direct 
orientation in the DNA sequence used to produce the recipient line. In a further preferred 
embodiment, the coding region of a selectable marker gene and preferably temiination 
signals fused at the 3' end of the coding region are comprised in the DNA sequence 
precisely situated outside of one of the recombinase target sites flanking the DNA stretch 
located between the two recombinase recognition sites (Fig. 2). The 5' end of the coding 
region is adjacent to one of the recombinase recognition sites. In a pref enred embodiment, 
the recombinase recognition site is inserted in an intron located in the untranslated leader at 
the 5' end of the coding region. In another prefen-ed embodiment, the recombinase 
recognition site is inserted in the 5' untranslated region of the selectable mari<er gene. In 
another preferred embodiment, a portfon of the coding region of the selectable mari<er 
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gene, preferably a 3' portion. Is used, wherein the 5' end of the portion of the coding region 
of the selectable marker gene is adjacent to one of the recombinase recognition site. In a 
further preferred embodiment, an intron or portion thereof is operably linked to the 5' end of 
the coding region of the selectable maker gene or to a portion thereof, wherein the 5' end of 
said intron or portion thereof is adjacent to one of the recombinase recognition sites. 
In such construct, no transcription of the selectable marker gene is possible because of the 
absence of a promoter directing the expression of the selectable marker gene. Such 
selectable marker gene can be any selectable marker gene described below or othenArise 
known in the art. In a preferred embodiment, such selectable marker gene is the bargene. 
In a further preferred embodiment, the DNA sequence also comprises an expression 
cassette which is included between the two recombinase recognition sites in the DNA 
sequence. The expression cassette preferably comprises a selectable marker gene, such 
as any selectable marker gene described below or othenwise known in the art. In a 
preferred embodiment, the selectable mari<er gene is an nptll gene, conf ening resistance to 
kanamycin and a preferred promoter In the expression cassette is the nos promoter. The 
DNA sequence desirably also further comprises another expression cassette which 
comprises a gene of Interest and desirably also further comprises a visible marker gene, 
such as. e.g.. a GUS gene, a luciferase gene or a GFP gene, or another selectable marker 
gene. The DNA sequence preferably also further comprises an expression cassette 
comprising a negative selectable mariner gene, such as a cytosine deaminase gene (Perera 
et al. (1993) PMB 23: 793-799). a Herpes Simplex Virus Thymidine Kinase gene {Czako 
and Marton (1994) Plant Physiol. 104: 1067-1071). a T-DNA gene 2 (Depicker et al. (1988) 

Plant Cell Reports 7: 63-66). 

The donor constmct used to transform the recipient line described above comprises 
the corresponding two recombinase recognition sites. In a preferred embodiment, the two 
recombinase recognition sites in such donor construct are in direct repeat orientation. In a 
further preferred embodiment, the donor constmct comprises a promoter located between 
the two recombinase recognition sites, wherein the promoter is adjacent to one of the 
recognition sites and is capable of directing transcription towards outside of the DNA stretch 
comprised between the two recombinase recognition sites. In a further preferred 
embodiment, the promoter is operably linked to a portion of the coding sequence of a 
selectable marker gene, preferably a 5' portion of the coding region of the selectable marker 
gene, preferably a portion of the coding region of the selectable maricer gene not present in 
the recipient construct. In a further preferred embodiment, an intron or portion thereof is 
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J ,M such P^rnote, is .,e u«,* 3 (uaQ3) pnmote, (Noms o, a,. (199 PMB 
^:Ls^6,\e<k>no,cons«uo.a,soocn,p.sesa.expres*„casse«eco.p^^^^ 

selectable mator gene is any seleclable mart<er gene descnbeci beiow or emerge 
:r^^llsdln,,™n,ase,e.ab,e.a*ergene,».p.sed,n.ne.y.«-ne 

Xden*odiman,.su*se,ec«bie.a,Ke,,eneis,neny,.n.,c,n,es»«n«^^^ 
CL. consin-C aieo dasi.b,v co.pnse. beiween *e «con*,na« re»^.» 

0 rll.— se.co^»».sne,»b^.*«-n,,™,e»o^.^ 
Zbinase s«ee end »h«, . e*>cen. .0 ..e . <f ^^^^ ^ 

sue end «,e co*9 «gK»> o, ^e ee^^bie .ad.e, gene - - ~" 
the « wo recogniton sites. Prale^biy, me .hl-d recombinase ,ecogn«n s,te s 
r^ded bytne ^» exp^ssion cassettes ™n,.ned .up.. A«e, a ^undo, 

,econ*U,ase.n,e*a,ed i^,««on, *e m„d .econ*,nase recognWon site a o»s^or 
„con*»«ee.ne«a,edU,tea*no„unhe,add,«onaitmnsgenesu.ng «i tn^ 

^ccb^ .cogniUon site and either one o, the ,»o -3™- " 
constn^t also further co,npr,sesbe,»eentna«o™combinase,eoogn,t™ s« 2^^^ 

ad*ona, expression cassette con,pnsing an, gene o. interest wh.=h ,e -» « 

.ne genome o. the re*en, iine. in such exp.s.on cassette, the expreseten o. the gene 

interest is controlled by any one of the promoters described below. 

open reoon^blnase-med^ed integratton o. «,e donor construe, *e n.^* 
seouence comprised between the t»o recombinase recognl«on sites In the rec^^n. i™ .s 
Z eZl n..eotlde seguence o, ^e d».r construe. wh«r is ^ted between .he 
rrc^gln sues In the donor oonstnrct. Tbe de.,^ ^««»n even, is re».e, b 
Acting lor resistance .o the seiec«bie marker gene. Wl«n .be cor^g sequence o< ^ 
marker gene . lo«.ed ou^ o, the ONA stret* oon^r^d be««en «,e two 
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Jupon ^comblnase^edlated WegraUon of .he dono, coosBu«. Wl»n »> <^ 
orTrrmereo, a« tnCudM ,n ^ dono, »nd ,e*o« oohsm* . .-ctona, 
sL^La^o, ,..0 ooo^Ks^g a sp,lca.,a ...n . c«a.d upon «o„*nasa. 

mediated integration of the donor construct. 

rI.o„ ,0 Loc«on ,0, reslsunce .0 «.a se,sc««e ™*or ge„a located o*^ o, .^^^^ 
ON 1,* co.p^ed .e.»ee„ «,e .wo ,e„n,«.ase reoo^Mon ^ ^ '^-^ 

0, as a„ a,.e-vs. *-sped«o ,eoo.b^ase-™,ed^.ed .n,e,.«on ,s -"-"^-^ "» ^ 
o.«,ene9a««se,ec.ab,ema*ergeneU>ca«dbemeen«,e«oo,T*«se«cogn«o„».e, 

in the recipient line. 

1„ano*erer*»di™>en.,aredpl.n.llnettana.o™edw«hadi«.ren.DNAse.^encefe 
:2d.P..«a.>.,.e.od«,e^.recon*-re»gn*on^esa..*^^ 

me DNA sequence used « p«.duce .he «dp.en. .Ine. In a .urther embod.meh,^such DNA 
sequence a^ con^ - e W oasse«e wh,ch . .nCuded ^^-"^^ ^ 
JomWhase ^cognWon sites .n fte DNA sequence. The expression casse«e compnses 
H.e « Jgene. such as any se,ecuh,e .a.er gene descnhed 
Known .h *e art. .n a p«.ened embodmeht. «,e seleoBble marker gene rs a kanamyc n 
^nce gene and a preferred pro.o.er ,n .he exp.s*n casseUe . me nospron»te. 
Tne DNA sequence desi^bW a.o cor^rises ano.he, expression cassene be^njhe «o 

recomblnase «co,nl,lon sHes, ^ comprises a gene o. Imeres., and a.^ des^b^ 

..nher compHses a visible mad<er gene, such as, e.g., a GUS gene, a "J™^ 
GFP gene, or ano.h,rse.e«ablema*er gene. The DNA serenes p,e.e«bWateo.ud^ 

comprises an expression casse..e compr^lng a negate selec^Me mad»r gen. ^e DNA 
sequence Is .ranslonmed l„,o a suable hos, plan. ce. and a se,eo«b.e mad^r o. «» DNA 
sequence is used .0 selec. nanlomiatton evente. 

A donor cons.mc. used .0 Uanslom, a redplen. .Ine as descdbed imn«dia.eV above 
comprises me corresponding «ro dWeren. recomblnase racognlUon snes. In a pralerred 
embodlmen., me ^ recomblnase ,eoogn.«on s«es m such donor cons»«. are ,n drreC 
odenraaon. The donor consm,:. ateo comprises an expression casse..e «.mpns,ng a 
setecUbte martce, gene -mse-ted be*«een me Wo recomblnase recognwn snes. Such 
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setectable mariner genes is any selectable marker gene described above or other^^^^ 
known in the art. but is different from the selectable marker gene comprised in the recpient 
una The donor construct also desirably comprises between the two recombinase 
recognition sites an expression cassette comprising a visible marker gene, such as. e.g.. a 
GUS gene, a luciferase gene or a GFP gene. Additionally, the donor construct also 
desirably comprises a third recombinase recognition site, which is different from the other 
two recombinase recognition sites, between the first two recognition sites. Preferably, the 
third recombinase recognition site is surrounded by the two expression cassettes mentioned 
supra After a first round of recombinase-mediated integration, the third recombinase 
recognition site allows for recombinase-mediated integration of further addrttonal 
transgenes using the third recombinase recognitton site and any one of the first two 
recognrtion sites. The donor constmct also further comprises between the first two 
recombinase recognition sites an additional expression cassette comprfeing any gene of 
interest which is to be Integrated into the genome of the recipient line. In such an 
expression cassette, the expression of the gene of interest is controlled by any one of the 
promoters described below. Such gene of interest Is an insect resistance gene, a d.sease 
resistance gene, a hertDiclde resistance gene or a gene encoding a value-added tra.t. 
upon recombinase-mediated integration of the donor construct, the nucleotide sequence 
located between the two recombinase recognition sites in the recipient line is replaced by 
the nucleotide sequence of the donor construct which is located between the two 
recognition sites in the donor construct. The occurence of the desired integration evem .s 
monitored by resistance of a host cell to a selectable mari<er gene of the donor construct or 
lack of resistance to a selectable maricer gene or lack of expression of a visible maricer gene 
previously integrated in the recipient line, and which was lost due to the site-spechc 
integration event. In addition to selection for resistance to the selectable maricer gene 
located between the two recombinase recognition sites in the donor constmct. or as an 
alternative, site-specific recombinase-mediated integration is monitored by the toss of the 
negative selectable marker gene which is located between the recombinase recogn^on 
sites in the recipient line. 

V. Supply of Active Recombinase 

An active recombinase is provided to cells of the recipient line to catalyze the site-specific 
recombination. In a preferred embodiment, a DNA sequence comprising the coding 
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Ts 0. do.0, const^C V. reoo^ln- .re u«.. De*a«,. . 1 :1 
"5V10or,:.00„»la,,a.odo«,ro<»,s»u.«vs.«con«nase«also*s,«bl,u»d- 
1 .5, 1 .lu or 1 . lu ^^^^^ construct vs. 

more active recombinase is necessary. A 5.1 . 10.1 or iQU.i mo 
recombinaseisalsodesirablyusedifmoredonorconstmctisnecessanj. 

another preferred embodiment, the recipient line is stabty transformed w t a DNA 

easseae c»,np*ln, «» oodins sequence o, a .ecombinaee ,s uaed. In a prele^^ 
rl»n, L J. an exp™ss,on casseUe ,ne co*= --ce o. a^,cc^».- 
ftoked by pa-alel matching sites, lor example parallel malchm, FBT s«es. Up 
Zl"o'-'ec^-ses,te.speci,.,nte.ra«ono,a,eneo,l„.eres^a^v^a^ 

elon of the reoomblna^ expression catena trom .he hcs, genome . alao ohU,n«i. 
l„ano«,erpre.e,reden,bo*cn,,.herecon*»,ase,spro*adUans,e„«»U>mapla^^^^^^ 
: a prelerr^ emboaiment, the co*, se,.e„« o. m, -aeon*.- ^ ^ZTS 

In, seZence 0, the reconr^ase ^ oo.hp*"' I- ^^P-''"" -=="7* *X 
:^:ss.L,.he^.«nasa.ce.lso,^rec*.n.«ne,VW,ep»ccnsa«~^ 

.led Viruses, e,..rom,e,r«^s.st«.as,ro..heMa.ea.reaK™^^^^^^^^^ 
Hohn (.995) J sen V«™i 76.965^9, and pi^d si^atanualiy as desa,b^ n - pe^^^^^^ 
,pp«c^«on08^17.e76,hon.*eWheatdwaH,^(MaUa«et^ai99.)Plan.C^^^^^^^^ 
2^, a tobacco gemini vinrs such as. a.g, me tobacco go«en mosa. vinis. or from 
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other viruses. In a preferred emlK)din,ent. the viral replicon comprising the recombinase 
coding region is Introduced into cells of the recipient line by Agroinfection (Grimsley et ai. 
(1989) Mol Gen Genet 217:309-316). 

In a further preferred embodiment, the coding sequence of the recombinase is supplied to 
the recipient cells as a messenger RNA (mRNA). The advantage of such technique .s that 
the recombinase is only provided to the host cells transiently. The coding sequence of the 
recombinase is inserted in a vector for in-vitra transcription of the RNA using methods 
described in Lebel et al. (1995) Theor. Appl. Genet. 91: 899-906 or In co-pending 
application 08/717.676. The RNA is then transfomied into cells of the recipient line, 
preferably by co-transfomiation v«th the donor constoict In a preferred embodiment, the 
RNA is transferred to the host cells using particle bombardment as described in co-pend.ng 
application 08/717.676. In another preferred embodiment, the RNA is introduced into the 
protoplasts or other cells of the recipient line by PEG-mediated transfomiation as descnbed 
in Lebel et al. (1995) Theor. Appl. Genet. 91: 899-906 or by electropotation. In another 
preferred embodiment, other techniques, such as microinjection of the RNA, are used. 
In another preferred embodiment, the active recombinase is supplied to the cells as a 
purifed protein which is introduced into the cells of the recipient line, for example by 
microinjection, preferably together with the donor construct (see e.g. Neuhaus et al. (1993) 
Cell 73:937-952 ). 

A method of the present invention also provides for improved transformation efficiency due 
to more efficient interactions between the recombinase recognition sites in presence of 
active recombinase. For example, improved transfomiation efficiency is achieved for 
meristem transformation, thereby avoiding the need to go through tissue culture in some 
crops. 

VI. Selectable Marker Genes 

Several selectable maricer genes are available for the present invention but are not limited 
to those described below. For certain target species, different antibiotic or herbicide 
selection mariners may be preferred. Selection mari<ers used routinely in transformation 
include the nptll gene which confers resistance to kanamycin and related antibiotics 
(Messing & VIerra. Gene 19: 259-268 (1982); Bevan ef a/.. Nature 304:184-187 (1983)). the 
tergene which confers resistance to the herbicide phosphinothricin (White et al., Nucl 
Acids Res 18: 1062 (1990). Spencer ef a/. Theor Appl Genet 79: 625-631(1990)). the hph 
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gene which confers resistance to the antibiotic hygromycin (Biochinger & Diggelmann. Mol 
Cell Biol 4: 2929-2931). and the dWrgene. which confers resistance to methotrexate 
(Bourouis era/.. EMBO J. 2: 1099-1 104 (1983). Selection markers resulting in positive 
selection, such as a phosphomannose isomerase gene, and described in patent application 
08/527,474, are also used. 

VII. Plant Transformation 

Plants transf omied in accordance with the present invention may be monocots or dicots and 
include, but are not limited to. maize, wheat, barley, rye. sweet potato, bean. pea. chicory, 
lettuce, cabbage, cauliflower, broccoli, turnip, radish, spinach, asparagus, onion, garlic, 
pepper, celery, squash, pumpkin, hemp, zucchini, apple, pear, quince, meton.plum. cherry, 
peach, nectarine, apricot, strawberry, grape, raspberry, blackberry, pineapple, avocado, 
papaya, mango, banana, soybean, tomato, sorghum, sugarcane, sugarbeet, sunflower, 
rapeseed. clover, tobacco, carrot, cotton, alfalfa, rice, potato, eggplant, cucumber. 
Arabidopsis thallana, and woody plants such as coniferous and deciduous trees, especially 
maize, wheat, or sugartjeet. 

Once a desired DNA sequence has been transformed into a particular plant species, it may 
be propagated in that species or moved into other varieties of the same species, particulariy 
including commercial varieties, using traditional breeding techniques. 
For their expression in transgenic plants, a DNA sequence may require modification and 
optimization. It is known in the art that all organisms have specific preferences for codon 
usage, and the codons in the nucleotide sequence comprised in the DNA molecules of the 
present invention can be changed to confomi with specific plant preferences, while 
maintaining the amino acids encoded thereby. Furthemiore. high expression in plants is 
best achieved from coding sequences which have at least 35% GC content, and preferably 
more than 45%. Nucleotide sequences which have low GC contents may express pooriy 
due to the existence of ATTTA motifs which may destabilize messages, and AATAAA motifs 
which may cause inappropriate polyadenylation. Although prefen-ed gene sequences may 
be adequately expressed in both monocotyiedonous and dicotyledonous plant species, 
sequences can be modified to account for the specific codon preferences and GC content 
preferences of monocotyledons or drcotyledons as these preferences have been shown to 
differ (Murray etal. Nucl. Acids Res. 17: 477-498 (1989)). In addition, the nucleotide 
sequences are screened for the existence of illegitimate splice sites which cause message 
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«,„ca«on.«lch,n,.s«,u«.ob,™de».h,n me nudeo«de sequences such as 
7eLed above a« nade using weU .no« .echn„ues o. s«e directed mu^^ne. . PCR, 
and aynme* gene cons»uc«on using «.e memods descnbed in «,e pubi^hed pa»n, 
.pp,lca«ons6P0 385962,EP0 359 472.andWO93»7278.l.necessa„,««se 

nLca^ons are pedcn^d on a „udeo,«e seouenco o, me prosen, Invenbon .n pa,-^ 
me coding region o. a recomb»«sa, of a ma*er gene or o. an, ge,» o. .teres, to be used 

in a plant of the present invention. 

For efficient initiation of translation, s^^^^^ 

require modification. For example, they can be modified by the inclusion of sequences 
known to be effect^e in plants. JoshI has suggested an appropriate consensus for plants 
(NAR 15: 6643-6653 (1987)) and Clontech suggests a further consensus translation 
nitiato7(1993/1994 catalog, page 210). These consensuses are su-.^^^^ 
nucJdesequencesofmislnvention. The sequences are incorporated into con— 

comprising the nucleotide sequence, up to and Including the ATG (whilst leaving the second 
amino acid unmodified), or alternatively up to and including the GTC subsequent to the 
ATG (with the possibility of modifying the second amino acid of the transgene). If 
necessary, these modifications are performed on a nucleotide sequence of the present 
invention, in particular the coding region of a recombinase. of a mariner gene or of any gene 
of interest to be used in a plant of the present invention. 

The DNA molecules in transgenic plants are driven by a promoter shown to be functional in 
plants. The choice of promoter will vary depending on the temporal and spatial 
requirements for expression, and also depending on the target species. Although many 
promoters from dicotyledons have been shown to be operational In monocotyledons and 
vice versa, ideally dicotyledonous promoters are selected for expression in dicotyledons, 
and monocotyledonous promoters for expression In monocotyledons. However, there is no 
restriction to the provenance of selected promoters; It is sufficient that they are operational 
in driving the expression of the DNA molecules In tir^e desired cell. 
Preferred promoters which are expressed constitutively Include promoters derived from 
Agrobacterium opine synthase genes, e.g. the nos promoter, or a dual promoter from the 
Agrobacterium Tl plasmid (Velten et al. (1 984) EMBO J. 3: 2723-2730). or viral promoters 
operable in plants, e.g. the CaMV 35S and 19S promoters, and promoters from genes 
encoding actin or ubiqultln. Another pref ered promoter Is a synthetic promoter, such as the 
Gehrin super MAS p« (Ni et al. (1995) Plant J. 7: 661-676). The DNA molecules of 
this invention can also be expressed under the regulation of pK,moters which are chemically 
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^«ed. Prelened »*noW .« *e,*a, WuCto- o. gene exp^sston Js *.*d in .he 
chemica! Induction is the tobacco PR-1 a promoter. 

tav. been deecnbed v*.ch are expressed a. »ound s»es and also a, *e 8«, o. 
pn^pamo9e„,n.ec*n.Pre.er.dp,omo.e.of,his«ndi^ude.hose ese*«»by 

ZL el MOI. sen. Gene., aoo-208 ,.989,. Xu e,a,. P.amMe.ec B^^^^^ 

^M,993,.Loge.an„e,a,.P.an,Ce.l:.6...58,.989,,Roh„^er.&U«ena„.Mo,e^. 

"""^ ^ , „. .M«i«^ Rini 92- 129-142 (1993), and Warner ef 

Biol. 22: 783-792 (1993). Firek etal. Plant Molec. Biol. 2g. 1Z9 14^ (i» i. 

a/ Plant J. 3: 191-201 (1993). 

P„,ened «'ss.e spec»lc expression patterns ^*de green «esue spe*, .00. spec*o 
.:rspe..ic. end ..wer spec,... Pron»«rss*ab,e,orexpress^n,n green .ssue,^^ 

,nany«hicn,egu.a,e genes ,nvo,ved.npho,os,n*e.»andn«nyc.«»sehaveb^^^^ 
coned .mm bott, monocoWedons and *»„Wons. A prefened p,om«er » ^ r«« 
PEPC pron».er .ron, the phosphoenC carboxytee gene <Hudspe«. & Gnha. P.an Mc-ec. 
Bio. 12:579.589(1989)). A prefened promoter .or ,00. specHte expression ,s».a. 
^ ..e Framond FEBS 103-106 ,1991): EP 0 452 269) and a ,u,mer 
p,e.enedroo..speo«icpn,m«e,te.ha.*om«.oT-1 gene pn=*.ed by m,s ,nven«on^ A 
^la Stem spe* p^moter . .ha, des.l.ed ,n US pa.en, 5,825.1 36 end whrch dnves 

expresston 0. tt,e maize «p>. gene, now kno« as matee Indote syn.hase ,see Frey e. a,. 

(1997)Solence277:696-:Melaneone.al.(.997)PNAS94;13345). 

P,^eLen*o*menteo.tt,elrw«.«ona,e.ransgen,cp.an,sexpressingaDNAmo,ecute 

.„ a ,oo,.spec«ic .ashion. Fur.he, preferred embodimenrs are .mnsge^o p,an.s expressrng 

«,e DNA sequence in a «ound*ducible or pa«.cgen ln.eo.ion*ducibte manner 

^ «mo. .0 .he sele^oo o. a su«ab,e p^mCer, cons«uc«ons .or expression of *e codrng 

«ions o. .he presen. inven«on in plante preferably require an appropriate »anscr^«on 

tem,ina.or ,0 be a..ached do»s.ream o, .he heteroK.gous nuoteo«de aequ=«ce. 

such tem.lna.ors are available and kno«n In «,e en (e.g. .m, from CaMV. E9 frorn *cS). 

Any available .emrinafor Icnown .0 funCion in plante can be used in m con.ex. o. m 

invention. 

Numerous o»,er sequences can be incorporated ln.0 expression cassettes forUte 
expression of a coding regton of tt* in»en«oa TTtese mdude sequences wh»* have been 
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Shown to enhance expression such as intron sequences (e.g. from Adhi and bronzel) and 
viral leader sequences {e.g. from TMV. MCMV and AMV). 
It may be preferable to target expression of a DNA sequence to different cellular 
localizations in the plant. In some cases, localization in the cytosol may be desirable, 
whereas in other cases, localization in some subcellular organelle may be prefered. 
Subcellular localization of transgene encoded enzymes can be undertaken using 
techniques well known in the art. Typically, the DNA encoding the target peptide from a 
known organelle-targeted gene product is manipulated and fused upstream of the 
nucleotide sequence. Many such target sequences are known for the chloroplast and their 
functioning in heterologous constructions has been shown. 

Vectors suitable for plant transfomiation are described elsewhere in this specification. For 
Agrobacteriu/THTiediated transfomiation. binary vectors or vectors carrying at least one T- 
DNA border sequence are suitable, whereas for direct gene transfer any vector is suitable 
and linear DNA containing only the constnictton of interest may be preferred. In the case of 
direct gene transfer, transformation with a single DNA species or co-transfom^ation can be 
used (Schocher etal. Biotechnology 4: 1093-1096 (1986)). 
The recombinant DNA sequences can be introduced Into the plant cell in a number of 
art-recognized ways. Those skilled in the art will appreciate that the choice of method might 
depend on the type of plant. i.e. monocot or dicot. targeted for transformation. Suitable 
methods of transforming plant cells Include microinjection (Crossway et al., BioTechniques 
4:320-334 (1986)). electropora«on (Riggs et al, Proc. Natl. Acad. Sci. USA 83:5602-5606 
(1986). /\g/Dbarterium-mediated transfomiation (Hinchee et al.. Biotechnology 6:915-921 
(1988)). direct gene transfer (Paszkowski et al.. EMBO J. 3:2717-2722 (1984)). and ballistic 
particle acceleration using devices available from Agracetus. Inc.. Madison. Wisconsin and 
Dupont, Inc., Wilmington. Delaware (see, for example, Sanford et al.. U.S. Patent 
4.945,050; and McCabe et al.. Biotechnology 6:923-926 (1988)). Also see. Weissinger et 
al., Annual Rev. Genet. 22:421-477 (1988); Sanford et al.. Particulate Science and 
Technology 5:27-37 91 987)(onion); Christou et al.. Plant Physiol. §Z:671-674 
(1988)(soybean): McCabe et al.. BioH-echnology 6:923-926 (1988)(soybean): Datta etal.. 
Bio/Technology 8:736-740 (1990)(rice); Klein etal.. Proc. Natl. Acad. Sci. USA. 
85:4305-4309 (1988)(maize); Klein et al.. Biotechnology 6:559-563 (1988)(mai2e); Klein et 
al.. Plant Physiol. 91:440-444 (1988)(maize); Fromm et al.. Bion-echnology 8:833-839 
(1 990); and Gordon-Kamm et al.. Plant Cell 2:603-618 (1 990)(maize); Svab et al. Proc. Natl. 
Acad. Sci. USA 8Z: 8526-8530 (1990) (tobacco chloroplast): Koziel et al. (Biotechnology U: 
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194-200 (1993)) (maize); Shimamoto etai Nature 338: 274-277 (1989) (rice); Christou etai 
Biotechnology g: 957-962 (1991) (rice); European Patent Application EP 0 332 581 
(orchardgrass and other Pooideae): Vasil et al. (Biotechnology H: 1 553-1 558 (1993) 
(Wheat); Weeks etai (Plant Physiol. 102: 1077-1084 (1993) (wheat). Pollen transformat.on 
(US Patent 5.629,183) is also used in the present invention. 

One particularly preferred set of embodiments for the introduction of recombinant DNA 
sequences into maize by microprojectile bombardment can be found in U.S. Serial No. 
08/008.374, herein incorporated by reference in its entirety. An additional prefen-ed 
embodiment is the protoplast transformation method for maize as disclosed in European 
Patent Application EP 0 292 435. as well as in U.S. Serial Number 08/024.875. hereby 
incorporated by reference in its entirety. 



EXAMPLES 

Example 1 : Construction of a FLP Expression Vector for Plants 

The FLP recombinase coding sequence is obtained from pOG44. a mammalian expression 
vector (Stratagene. LaJolla. CA). A vector containing only the coding sequence is created 
by ligating the 0.76 kb Hindlll-EcoRI and the 0.63 kb EcoRI-Kpnl fragments of pOG44 .nto 
the Kpnl-Hindlll sites of pLITMUS 29 (New England Biolabs. Beverly. MA). In order to place 
a Sg///site from a polylinker at the 3' end of the gene, a BamH/-SnaS/ fragment is inserted 
into the BamH/-EcoflVsites of pSGCJH4. An Arabidopsis thaliana polyubiquitin gene 
promoter {UBQ3. Norris et al. (1 993) PMB 21 :895-906) fusion to FLP recombinase is 
constmcted by ligation of the BamHI-Bglll fragment from the previous vector into the BamHI 
site of PCIB7813 (UBQ3 promoter and intron in the 5' nontranslated region with a BamHI 
site before the nos temiinator. This FLP recombinase plant expression vector is pAT152. 
In order to make a non-functional FLP recombinase as a control for random versus site- 
directed integration, the EooRI site in the FLP recombinase coding sequence is filled in with 
Klenow. creating a frame-shift mutation. The mutated recombinase is fused to the UBQ3 
promoter as above and is known as pATI 53. 
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Example 2: Construction of Vectors for Transient Assays Containing Wild-type and 
Modified Ff7r Sites 

A vector containing a 1.4kb stuffer fragment (neomycin resistance cassette) flanked by FRT 
sites inserted into the untranslated leader of a ludferase gene is constructed to test the 
excision efficiency of FLP reoomblnase on wild-type FRTsites in plants. The 35S leader 
and luciferase gene are excised from pPH121 as a BamHI-Xbal fragment and ligated in a 
three^ay ligation to a Spel-Hindlll nos tenninator fragment and BamHI-Hindlll cut 
pbluescript SK+. creating pAT155. A wild-type FRTsite with BamHI-Bglll sticky ends is 
assembled by annealing and kinasing two oligonucleotides: 5' -GAT ccg aag ttc cta 

TTC CGA AGT TCC TAT TCT CTA GAA AGT ATA GGA ACT TCG AAT TCC A-3 ' 

(SEQIDN0:7)and 5'-gat ctg gaa ttc gaa gtt cct ata ctt tct aga gaa 
TAG GAA CTT CGG AAT AGG AAC TTC G-3 ' (SEQ ID N0:8). The FflT Site is inserted 
before the 35S leader by ligation of the BamHI-Bglll double-stranded oligonucleotide into 
the BamHI site of pAT155 to create pATI 61 . The Arabidopsis UBQ3 promoter and leader 
from pPH1 21 is inserted as a BamHI fragment into the BamHI site of pAT16i . yielding 
pAT162. In order to disrupt luciferase expression, a Xda/ fragment from pNEObGAL 
(Stratagene. La Jolla. CA) containing a neomycin resistance cassette flanked by FflTsites 
is inserted into the Xba/site of pAT162, resulting in pAT163. The Xba/ sites are in the 
spacer region of the FflTsite and insertion of the pNEObGAL Xbal fragment into pAT162 
recreates a stuffer fragment flanked by single FRTsites in the same orientation. 
Vectors are also constmcted with combinations of wild-type and modified FRTsites (P and 
F5. Schlake and Bode (1994) Biochemistry 33: 12,746-12.751) flanking a 1.4kb stuffer 
fragment (neomycin resistance cassette) inserted into the untranslated leader of a 
luciferase gene in order to test the excisional activity of FLP recombinase on the FRTsites 
with modified spacers and the cross-reactivity between different FRTsites in the presence 
of active recombinase. The following FRTsites containing BamHI-Bglll sticky ends are 
assembled by annealing and kinasing two oligonucleotides: a P modified FRTsite: 5 ' - 

GAT CCG AAG TTC CTA TTC CGA AGT TCC TAT TCT TCA AAT AGT ATA GGA ACT 
TCA-3' (SEQ ID N0:9) and 5' -GAT CTG aag TTC CTA TAC TAT TTG AAG AAT AGG 
AAC TTC GGA ATA GGA ACT TCG-3 ' (SEQ ID NO:10). a F^ modified FRT Site: 5 ' -GAT 
CCG AAG TTC CTA TTC CGA AGT TCC TAT TCT TCA AAA GGT ATA GGA ACT 

TCA-3' (SEQIDN0:11)and5'-GAT ctg aag ttc cta tac ctt ttg aag aat 
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AOG AAC TTC GGA ATA GGA ACT TCG-3 ' (SEQ ID N0:1 2) and a Wild-type FflTs.te : 

T-oT. CCG AAO TTC CTA TTC CCA ACT TCC TAT TCT CTA OAA ACT ATA OCA 
ACT TCA-3' (SEQIDNO:13)and5'-GAT CTG AAG TTC CTA TAC TTT CTA GAG 
^T AGG AAC TTC GGA ATA GGA ACT TCG-3 ' (SEQ ID N0:14). A BamHI-Xhol 

fragment from pNEObGAL containing ti.e 3' part of the neomycin resistance ca^ettej^s 
subcloned into the respective sites of pLIT28. creating pAT184. The wild-type. F3 and F 
FRTsites were inserted into the BamHI site of pAT184, creating pAT189 (wild-type FRT 
stte). PAT187 (F3 FflFsite) and pAT186 (F5 FHTsite). PGR was perfom,ed to add a Bglll 
site to the 5- end of the neomycin resistance cassette from pNEObGAL using 
o,igonucleo«des5.-ACA cgg cgg cat cag ago ag-3' (SEQ.DNO:15)and5-TAA 
CAG ATC TCC ccc TGG CGA AAG G-3 ' (SEQ ID N0:16) and the product is doned 
directly into the pGEM-T Easy vector (Promega. Madison. Wi). creating pAT190. The Bglll 
site of PAT190 is used to insert the BamH/-Bg/// wild-type FflTsite (pAT199). the P FRT 
site (PAT197) and the P FflTsite (pATI 96). In order to insert the neomycin resistance 
cassette flanked by FRTsites in the untranslated leader of the luciferase gene, a vector 
with a unique BamHI site in the leader is created. pAT175 is constructed by a three-way 
ligation of an Xba/-BamH/ fragment of pClB7812 (Arabidopsis UBQ3 promoter and leader) 
and BamH/-EcoR/fragment from pAT155 (35S leader, luciferase gene and nos tem,.nator) 
into Xbal-EcoRI cut pUC19. Into this unique BamHI site of pAT175 is inserted: a BamHI- 
Xrto/fragment from pAT186 and a Xrto/-Bg/// fragment from pAT196 to create pAT208 (P 
FHTsites flanking neomycin resistance cassette), the respective fragments from pAT187 
and PAT197 into pAT175 to create pAT207 (P FRTsites flanking the stuffer cassette), the 
respective fragments from pAT189 and 196 into pAT175 to create pAT210 (wild-type and P 
FRTsites flan^ng the stuffer). the respective fragments from pAT189 and 197 into pAT175 
to create pAT21 1 (wild-type and P FRTsites flanking the stuffer fragment) and the 
respective fragments from pAT187 and 196 into pAT175 to create pAT213 (P and P FRT 
sites flanking the stuffer fragment). The FRTsites flanking the stuffer fragments in pAT206- 
21 3 are in the same orientation. 
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Example 3: Tobacco Transient Expression Experiments 

Tobacco transient assays are performed to test the excisional activity of FLP recombinase 
on the wild-type and modified FRTsites and the cross-reactivity between different FRT 
sites Tobacco leaf mesophyll protoplasts are transfomied by the polyethylene glycol- 
mediated method as described by Shillito and Saul (Plant Molecular Biology. A Practical 
Approach. C.H. Show eds. IRL Press. Chapter 7. pp. 161-186) with the following 
modifications. The heat shock treatment and carrier DNA addition steps are omitted. The 
protoplasts are resuspended at 1 x 10^ protoplasts/ml. 25 mg of each plasmid is added and 
PEG solution is added to a final concentration of 1 6.5%. The protoplast pellets are 
resuspended In 2.0 mis of K3A medium and incubated in the dark at 24-26 °C for 24-27 
hours. The protoplasts are diluted with 8 mis of W5. centrifuged and the pellets 
resuspended in 0.5 ml of lOOmM KPO4 (pH7.0). ImWI DTT. The protoplasts are lysed by 
perfomting three f reeze-thaw cycles of 3 minutes at -60°C followed by 3 minutes in a room 
temperature water bath. The tubes are centrifuged at 4 °C to remove the cell debris and 
luciferase assays are perfomied on the supematants using the Luclferase Assay System 
(Promega. Madison. Wl). Luminescence is measured for 10 seconds on a MonoKght 2010 
luminometer (Analytical Luminescence Laboratory, San Diego. CA). The results of the 
transient experiments are shown In the following Table. 







PAT1 63 (control) 


2913 




pAT152 + pAT163 


212971 


73X 


pAT206 (control) 


1372 




pAT152 + pAT206 


23345 


18X 


pAT207 (control) 1381 




pAT152 + pAT207 \ 52217 


38x 


DAT152 + DAT210 ! 5847 




pAT152 + pAT211 


6302 




|pAT152 + pAT213 I 3147 
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Example 4: Construction of a Recipient Vector Containing Wild-type and Modified 
FAT sites 

A complete 48 bp wild-type Fflfslte is assembled by annealing and klnasing two 
oligonucleotides (5' - cga agt tcc tat tcc gaa gtt cct att ctc tag aaa 

GTA TAG GAA CTT CG - 3 ' . SEQ ID N0:17) and (5 ' - AAT TCG AAG TTC CTA 
TAC TTT CTA GAG AAT AGG AAC TTC GGA ATA GGA ACT TCG GTA C - 3'. SEQ 

ID N0:18). In orderto place the FflTsite at the 3' end of a 35S terminator, the double- 
stranded oligonucleotide with 5' Kpnl and 3' EcoRI stidv ends is ligated into the KpnI-EcoRI 
sites of PGIB710 (Rothstein et al. (1987) Gene 53: 153-161). A t/BQ3 promoter luciferase 
gene fusion with the FRT site at the 3' end of the 35S temiinator is constmcted by first 
subcloning the terminator and FRT site as a BamHl-EcoRI band into the respective sites of 
pbluescript sk+ (Stratagene, LaJolla. OA), excising with Wof/-£cofl/. then ligating this 
fragment along with the UBQ3 promoter and lucHerase gene from pPH1 21 as a H/nd///-A/ot/ 
band into the Hindlll-EcoRI sites of pbluescript sk+. The UBQ3 promoter luciferase fusion 
with a wild-type FRTsite at the 3' end of a 35S terminator is inserted as a Ecofl/ fragment 
into PCIB730 (Rothstein et al.. 1987). a binary vector lacking a nptll gene, yielding pAT135. 
A 48 bp spacer mutant FflTsite (F=) is assembled by annealing and kinasing two 
oligonucleotides (5'-TCG acg aag ttc cta ttc cga agt tcc tat tct tca aaa 

GGT ATA GGA ACT TCG-3 ' . SEQ ID NO:19) and (5 ' -AAT TCG AAG TTC CTA TAC 
CTT TTG AAG AAT AGG AAC TTC GGA ATA GGA ACT TCG-3 SEQ ID NO:20). The 

mutant FRT- f^sHe is ctoned at the 3* end of a nos promoter, nptll gene, nos terminator 
gene cassette by ligating the double-stranded oligonucleotide with 5" Sail and 3' EcoRI 
stteky ends into the respective sites of a plasmid containing the entire nptll gene cassette 
from pCIBIO (Rothstein et al., 1987) excised as a Sac//-EcoR/ band and cloned into the 
respective sites of pbluescript sk+. The nos promoter nptll gene cassette flanked on the 3' 
end with the mutant FRT- site is excised with Sacl-Sall from this vector and ligated into 
the respective sites of pAT135. This final vector contains the nptll gene and luciferase 
gene cassettes in divergent orientation flanked by a wild-type and a mutant FRT- F^site at 
either end and is known as pAT136. 



wo 99/55851 



-28- 



PCT/EP99/02813 



Example 5: Plant Transformation and Selection of pAT136 Transformants 

Agrobacterium ft/mefedens strain A136 containing the helper plasmid pCIB542 is 
transfomied with pAT136. Leaf disc transformation of Nicotiana tabacum c.v. Petit Havana 
(SRI) (Maliga et al. (1973) Nature 244: 29-30) is performed essentially as described by 
Horsch et al. (1988) Plant Molecular Biology Manual A5: 1 -9. Kanamycin resistant tobacco 
transfomiants are screened for Iuciferase activity using a Iuciferase assay system kit 
(Promega. Madison. Wl). Genomic Southern analysis of Iuciferase expressing 
transformants is performed to detemiine copy number and v^hether the left border of the T- 
DNA is intact. Genomic DNA is digested with SamH/and Southern blots are probed with a 
nptll probe to determine copy number. The presence of a 1 .45 kb band in Hindlll-BamHI 
digests probed with nptll conf im^ed the presence of the left border. Single-copy. Iuciferase 
expressing pATI 36 tobacco transfomiants with intact left borders are identified and 
maintained in vitro. Clonal plants are sent to the greenhouse and crossed to wild-type SRI 
plants to maintain the hemizygous transgene. 

Example 6: Construction of a Donor Vector Containing Wild-type and Mutant FRT 
Sites 

A nos promoter-bar gene cassette flanked by a mutant FRT site (F^) on the 5' end and a 
wild-type FRT site on the 3' end is constnjcted by first subdoning the nos promoter and bar 
gene from pGPTV-BAR (Becker et al. (1992) PMB 20: 1195-1197) as a Hindlll-BamHI 
fragment and the 35S temiinator and wild-type FRT site as a BamH/-Ecofl/ band (used in 
the constmctton of pATI 36) into HIndlll-EcoRI cut pbluescrlpt SK+. A 48 bp mutant F» site 
is assembled by annealing and kinasing the two oligonucleotides (5' - tcg acg aag 

TTC CTA TTC CGA AGT TCC TAT TCT TCA AAT AGT ATA GGA ACT TCA - 3'. 
SEQIDNO:21)and(5'-AGC TTG AAG TTC CTA TAC TAT TTG AAG AAT AGG AAC 

TTC GGA ATA GGA ACT TCG - 3 ' . SEQ ID NO:22). The mutant F^ site is added to the 
5- end of the nos promoter by ligation of this double-stranded oligonucleotide containing 5' 
Sail and 3' Hindlll sticky ends into the respective sites of the previous vector. The Sail site 
at the 5' end of the F^ site is adaptored to Spel and the entire bar gene cassette flanked by 
the FflTsites is excised as a Spel fragment and inserted Into the Xbal site of a vector 
containing a UBQ3 promoter GUS gene with a F** mutant site at the 3' end of the nos 
temiinator (the UBQ3 promoter GUS gene and nos terminator from ppeh27 (GUS) as a 
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Xbal-EcoRlband and the F« site as an EcoR/-Sa// fragment into Xba/-Sa//cut pbluescript 
sk+). This final vector containing the bar gene and GUS genes in divergent orientation 
separated by a mutant site and f lanlced by a wild-type and a mutant site on either end 
is known as pAT158. 

Example?: Protoplast Transfornfwtion 

Protoplasts from young leaves of pAT136 transformants are transfonned by the 
polyethylene glycol {PEG)-mediated DNA transfomiation method as described by Shillito 
and Saul. Agarose-embedded protoplasts are put in a 1 :1 mixture of K3A medium and 
H460 medium with 5 mg/1 or 10 mgfl BASTA after 7 days, with weekly media changes as 
described. After 3 weeks on selection, individual colonies of at least 2mm are placed on 
solid LS medium containing 5 or 10mg/l BASTA. 

Example 8: FLP Recomblnase-medlated Integratten of a Transgene Into a Pre- 
introduced Target 

Equal numbers of protoplasts from a pAT1 36 tobacco transf omiant are cotransfonned with 
either pAT152 / pAT158 (site-directed recombination using FLP recombinase) or with 
PAT153 / PAT158 (control for random integrations using a non-functional recombinase). 
Recombinase-mediated integrattons are scored as GUS positive/ lucif erase negative events 
whereas an eventfrom a random integration express both the GUS and luciferase genes. 
Confimiation of recombinase-mediated integration events are verified by PGR using a 
primer from the region between the wild-type FflTsite and the right border and a primer 
from the GUS gene. Southern analysis is carried out to confimn such an event. 

Example 9: Construction of Positive Control and Reverse Positive Control Piasmids 
for Transient Expression Experiments 

A 48 bp spacer mutant FflT site (F^) (Schlake & Bode. Biochemistry 33, 12746-12751 
(1994)) is cloned as an oligonucleotide into the C/a/site of the intron within the leader of the 
Arabidopsis ubiquitin-3 gene promoter/leader plasmid pPEH28. a derivative of pSPORTI 
(BRL. Bethesda. MD) also comprising a nos terminator. The F' oligonucleotide top and 
bottom strands are shown below (mutant core region shown in bold; lower case nucleotides 
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represent the single-stranded portion of the oligo that fonns a Cial cohesive end; additional 
nucleotides which fonn an EooRI site are italicized): 

5'-cga tGA ATT CGA AGT TCC TAT TCC GAA GTT CCT ATT CTT CAIW ilAO GTA 

TAG GAA CTT c-3 ' (top Strand. SEQ ID NO:23) and 5 -CGG AAG TTC CTA TAC CTT 
TTG AAG AAT AGG AAC TTC GGA ATA GGA ACT TC-3' (bottom Strand. SEQ ID 

NO:24). , . 

After ligation of the oligo pair into the Cial site of pPEH28. a clone named 
UbqFSnos is identified with the site oriented toward the ubiquitin-3 promoter by mapping 
its asymmetric EcoRI site and its structure was confim^ed by DNA sequencing. Next, a 
1745 bp BamHI fragment of pAT134 containing a 35S leader sequence and the coding 
sequence of the firefly luciferase gene is ligated into the BamHI site of UbqFSnos. A clone 
with the luciferase coding region in correct orientation with respect to the UBQ3 promoter .s 
identified by mapping and named UBQ3F5Lucnos. 

in order to place a wild-type FRT site (F«) 5' to the UBQ3 promoter, an oligo pair 
containing the 48 bp sequence of the wild type FRT site. F". flanked by Xhol cohesive ends 
IS inserted into the Xhol site of UbqFSLucnos. The oligonucleotide top and bottom strand 
sequences are shown below with the cohesive Xhol end in lower case, an Aatll cut site, a 
(C) at the 5- end and a (G) at the 3> end to destroy the Xhol site, and the F° site In standard 
type with its wild type core region in bold type: 

5 '-teg a(C)G ACG TCG AAG TTC CTA TTC CGA AGT TCC TAT TCT CTA GAA 
AGT ATA GGA ACT TC (G) -3 ' (top Strand. SEQ ID NO:25) and 5 ' -TCG ACG AAG 
TTC CTA TAC TTT CTA GAG AAT AGG AAC TTC GGA ATA GGA ACT TCG ACG 

TC6-3 ' (bottom strand, SEQ ID NO:26). 

The orientation of F° is detemiined by mapping the asymmetric Aatll site and 
confimied by DNA sequencing. The two plasmids formed are called F0fwdUBQ3F5Luc 
(with the F° and F^ sites in parallel orientation) and F0revUBQ3F5Luc (F° and F^ sites .n 
opposite orientation). Removal of the promoter from this constmct by Asp718l digestion and 
replacing it with spacer DNA fomis a plasmid with substantially reduced but still significant 
lucifemse transient expression in plant cells. One possible cause of this minimal- promoter 
activity is a TATA-like sequence within the terminal region of the UBQ3-intron. The high AT 
region at the 3' end of the UBQ3 intron is removed by digestion with Cial and Ncol, and 
replaced by an oligonucleotide pair constructed to fit into the same sites (top strand and 
bottom strand are shown below, cohesive end in lower case) to iom a tmncated intron 



wo 99/55851 



-31 - 



PCT/EP99/02813 



structure that we designated as dint: 5 '-cat gGA cct gaa aca aaa caa t-3' (top 
strand.SEQlDNO:27)and5-CGA ttg ttt tgt ttc agg tc-3' (bottom strand. 
SEQIDNO:28). 

The resulting modified plasmids are named the Positive Control 
(F0UBQ3F5dlntLUC) and Reverse Positive Control (FOrevUBQSFSdIntLuc). 

Example 10: Construction of Recipient and Reverse Recipient Vectors 

The UBQ3 promoter is deleted from the Positive Control and Reverse Positive Contol 
plasmids by digestion with Asp 7181. and a self-complementary oligonucleotide (5'-gta 
cAG ATC T-3'. SEQ ID NO:29) is ligated into their cohesive ends, converting their 
ASP718I sites io Bglll sites. In the resulting plasmids. FOBgFSdIntLUC and 
FOrevBgFSdlntLuc. the Bglll site is opened and a 518bp Bgllll fragment of the bar gene 
from pAT 157 is ligated between the F« and F« sites, introducing "spacer- DNA and forming 
the Recipient (FOSpFSdIntLuc) and Reverse Recipient (FOReySpFSdIntLuc) vectors. 

Example 1 1 : Construction of Donor and Reverse Donor Vectors 

From the Positive qontrol and Reverse Positive Control vectors, EcoRI is employed to 
remove the nos temiinator. ludferase coding region. 35S leader and the 3' end of the 
tmncated intron. sparing the F^ site. This fomis the Donor (F0UBQ3F5) and Reverse Donor 
(F0revUBQ3F5) vectors. 

Example 12: FLP Mediated Cassette Exchange between Recipient and Donor Vectors 
of Examples 10 and 11 

For Biolistic delivery of DNA. BY2 tobacco suspension cells are spread onto filters on 
culture medium containing 12% sucrose (which acts as an osmoticum prior to 
bombardment). Plasmid DNA of the Reverse Recipient. Reverse Donor and Reverse 
Positive Control vectors is precipitated onto <1.0nm gold beads (Crescent Chemical 
Company. Hauppauge. NY) as described by Goff. S. et. al. {EMBO J 9. 2517-2522. 1990). 
The DNA concentration used for the Reverse Recipient, Reverse Donor and Reverse 
Positive Control vectors is O.lmg /6 shots and the DNA concentration used for the FLP 
expression vector (pAT152) Is 0.5 mg/6 shots. Each filter with BY2 cells is bomba«led one 
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«me using 650-1 .100 ps. rupture discs. Three repl^tes are pertor^ed lor each treatn^e 
1 or^e L beir,g one repHcate. Tissue is assayed for transient iuciterase express.n 
.16 hours post-bombardment. Cel. .ysates are obtained using the Lucrferase Assay 
system (Promega. Madron. W.). Luminescence is measured for 10 seconds on a 
lought 2010 .uminometer as described in Example 3. Cei. .ysates are measu^ fo 
soiuble protein using the Bio-Rad Protein Assay System (Bio-Rad. Her^ies. CA) and 
luciferase specific activity is reported as light units (LU) per ug soluble protein. 
The results of the transient experiment are presented in below: 

I iiriffti rase Sp ?ftifift ActivitV 

Vector 

Reverse Recipient . Reverse Donor 152 ± 33 LU/ug protem 

Reverse Recipient . Reverse Donor . pAT 152 467 ± 105 LU/ug protein 

48,734 ± 9,658 LU/ug protein 

Positive Control 

Example .3: ConsTucMoa o< »> «P«/ Exp«»to. VecU»r wHh . Modllted mr Sitt m 
the Intron of the Gene Coding Region 

The nos promoter and nptll gene are excised from pGPTV-Kan (Becker. D. et a. Plant 

Molecular Biology 20: 1195-1197.1992) as a 1403 bp BamH/- C/a/ 

into the BamHI - Cal srtes 5' of the Act2 tem^inator (An et al. (1996) Plant J. 10. 107- ) 

forming an NPTIl expression cassette. NosNPTact. For insertion of ^^J^^-^^J^^h, 
nos promoter and the entire npf gene coding sequence is Cloned asa918bpN/,e,-BamH, 

fragment into pUCI 8 Coning vector (NEB. Beveriy. MA) to which is first added an srte 
by oligonucleotide insertion. An intron from Solanum tuberosum (Eckes. P.. Rosahl S. 
Schell J. Willmitzer L (1986) Mol Gen Genet 199:216-224) is PGR ampWied. cloned ,nto 
PCR21-TOP02 vector (Topo2 TA Cloning Kit. In Vltrogen). excised precisely as a 
SnaBl/Pvul (blunt) fragment and cloned into the BsaA/ site of the npt gene coding 
sequence. A done with the correct orientation of the intron Is identified by mapping and 
confimied by DNA sequencing. An oligonucleotide pair comprising a modified FflTsite (F . 

F' F^or F^ Schlake & Bode. Biochemistry 33. 12746-12751 (1994)) is introduced into 
thl cohesh^e ends of the Mfa/ site of the intron of npf//. utili^ng one of the synthe.c 
oligonucleotides depicted below, also carrying the AATT cohesive end). A clone with the 
FflT site in the correct orientation is identified by mapping and confirmed by sequencing. 
For example, for the F' Core Sequence, the following oligonucleotides are used (Nort and 
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EcoW sites ttalized. core sequence in bold): 5 -aat toe ggc cgc gaa gtt cct att 

CCG AAG rVC CTA TTC TCT AGA TAG TAT AGG AAC TTC GAA TTC-3 ' (tOp Strand. 
SEQIDNO:30)and5--AAT TGA ATT CGA AGT TCC TAT ACT ATC TAG AGA ATA 
GGA ACT TCG GAA TAG GAA CTT CGC GGC CGC-3 • (bottom Strand. SEQ ID N0:31). 

Oligonudeotides for through are identical to except in the FRT Site Core 
Sequence shown below: 

Oli gonucleotide Core Sequence 

pi 5 ' _TCTAGATA-3 ' (SEQ ID N0:6) 

p2 5 < -TCTACTTA-3 ' (SEQ ID NO:4) 

p3 5 ' -TTCAAATA-3 ' (SEQ ID N0:3) 

p4 5 ' -TCTAGAAG-3 ' (SEQ ID N0:5) 

pS 5 ' -TTCAAAAG-3 ' (SEQ ID NO:2) 

To reconstruct the original expression cassette, the nptll gene, now containing an intron 
with a modified FRT site, is excised as a 1 171 bp Nhel - BamHI fragment and re-ligated 
into the same sites of the original expression vector. From this cassette, the 3' untranslated 
region of Actin-2 is removed by BamHI - Sad digestion and replaced by a 343 bp Bglll - 
sad fragment containing the PAL 3' untranslated region (Wanner. LA et. aL. Plant Mot. 
Biol. 27 (2). 327-338 (1995). This produces a family of clones whose members are named 
NosNptlntFxPal, where x is 1 . 2. 3, 4 or 5 as defined above. 

For deliveiy into plant cells using Agrobaderium tumefadens. the desired 
NosNptlntFxPal is cloned into a binary vector, such a binary vector well-known in the art. as 
a 2073 bp Sad - Xhol fragment and the resulting plasmid is transfomied into a helper 
Agrobaderium strain suitable for the plant host to be transfomied. for example LBA4404 or 
A136(pCIB542)orMP90, 

Example 1 4: Construction of a FLP Expression Cassette 

The FLP gene coding sequence is cloned from pAT152, described in Example 1. into 
PUC18 as a 1395 bp Asp718l - BamHI fragment. The S. tuberosum intron described .n 
Example 13 is introduced into the EcoRV site of the FLP coding sequence and a done with 
the correct orientation is identified by mapping and confimied by sequencing. The Asp718l 
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Site is converted into a BamHI site and the FLP coding sequence, now containing an intron. 
is cloned as a 1589 bp BamHI fragment into the BamHI site between the Arabidopsis Actin- 
2 promoter and terminator (An et al. (1996) Plant J. 10: 107-121). A clone with the correct 
orientation of FLP in the expression cassette is identified by mapping and designated 
ActlntFLP. 

For delivery into plant cells using Agrobacterium tumefaclens. a 3665 bp Sad - Xhol 
fragment of ActlntFLP is cloned into a binary vector, and the resulting piasmid is 
transfomied into a helper Agrobacterium strain suitable for the plant host to be transfom^ed. 
for example LBA4404 or MP90. 

Example 15: Construction of an Excisable FLP Expression Cassette 

Integration of the FLP expression cassette into the plant genome is not needed and indeed 
not desirable during the process of site-specific integration. A way to reverse any such 
integration is to flank the cassette by parallel matching FRT sites. Integration of such a 
construct is reversed by FLP recombinase. which efficiently catalyzes excision of DNA 
between parallel matching FRT sites. 

Piasmid ActlntFLP (Example 14) is digested by Asp718l and Apal and a family of 
oligonucleotide pairs are individually ligated into these sites forming a FRT site that 
corresponds to F" or F'- F^. The oligonucleotide pair is indicated below. (Top strand of F Is 
shown; bottom strand is complementary except for the cohesive ends shown in lower case 
typescript at each end of the top strand. The sequence of eight N's in the center represents 
any desired core sequence. TCTAGAAA for F° or the various F'- F^ core sequences in 
example 13. The added [C] residue near the 5'-end is included to spare the Asp7J8/ site for 
future use, the Asp7181 and Apal cohesive ends are shown in lower case: 5'-gta 

c[C]G AAG TTC CTA TTC CGA ACT TCC TAT TCN NNN NUN NOT ATA GGA ACT 
TCg gcc-3' (top Strand. SEQ ID NO:32) and 5 • -G AAG TTC CTA TAC NNN NNN NNG 
AAT AGG AAC TTC GGA ATA GGA ACT TCG-3 ' (bOttom Strand. SEQ ID NO:33). 

The resulting product structure is identified by survival of an Asp718l site and loss of 
the Apal site, and confirmed by sequencing. The family of Intemiediates is designated 
FxActlntFlp, and is next digested with Sad and NotHo introduce the second (matching) F" 
site as an oligonucleotide pair shown below. Both strands of the oligonucleotides are 
shown for clarity: 5' -GAA gtt cct att ccg aag ttc cta ttc nnn nnn nng tat 
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AGG AAC TTC lGC]-3' (top Strand. SEQ ID NO:34) and 5 ' -teg aCT tca agg ata 

AGG CTT CAA GGA TAA GNN NNN NNN CAT ATC CTT GAA GICG] ccg g-3' 

(bottom strand. SEQ ID NO:35). The Ns represents a FflT core sequence that matches the 
F" core sequence at the other end of the same clone. Flanking sequence in lower case 
represents the cohesive ends for Sad (3'end of bottom strand) and NotI (5'end of bottom 
strand). The [GC] sequence at the right end is added in order to preserve the NotI site for 
future use. A clone with the correct oligo insert is identified by loss of the Sac/ site and 
confimied by DNA sequencing. The resulting excisable donor family is temied: 
FxActlntFlpFx. 

For delivery into plant cells using Agrobacterium tumefaciens. an Asp718l-Notl 
fragment of FxActlntFLPFx is cloned into a binary vector between Bsp120l and SacI sites 
and the resulting plasmid is transformed into a helper Agmbacterium strain suitable for the 
plant host to be transformed, for example LBA4404 or MP90. 

Example 16: Construction of a Family of Recipient Vectors Containing a far Gene 
Expression Cassette, a Partial npW Gene Expression Cassette and Wild- 
type and IModified FflT Sites 

Afamily of recipient vectors is constmcted with a bar gene expression cassette flanked by a 
wild-type and a modified FflT site with the sites either in parallel or opposite orientation 
relative to each other. Outside of the region flanked by the two FflT sites is the DNA 
sequence in the nptll expression vector described in Example 13 which is 3' of the modified 
FflT site. 

The family of recipient vectors is constructed starting from pN0V111 containing the 
UBQ3lntBARnos expression cassette. The RsrII and Xhol sites of pNOV 111 are changed 
by oligo converters into Hindlll and BamHI sites respectively. The cassette is then excised 
as a 2605 bp Hindlll - BamHI fragment into pBluescriptSK (Stratagene, Beverly, MA), 
fomiing pBS.UBQSInffiARnos. The wild-type FflT site. F°. is ligated into the Xhol site of 
pBS.UBQSIntBARnos as an oligo pair with flanking Xhol cohesive ends (top strand : 5 ' - 

teg aGA AGT TCC TAT TCC GAA GTT CCT ATT CTC TAG AAA GTA TAG GAA CTT 

CAC GCG T-3 ' . SEQ ID NO:36. core sequence in bold and Mlul site italized. and bottom 
strand: 5--TCG aac gcg tga agt tcc tat act ttc tag aga ata gga act 
TCG GAA TAG GAA CTT c-3', SEQ ID NO:37) and clones with F° in either orientation 
are Wentified by mapping the asymmetric Mlul site. The products are designated 
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UBQSIntBARnosFOfwd for the orientation of the top strand opposite that of the bar gene 
and UBQ3lntBARnosF0rev for the orientation parallel to bar. 

The tenninal half of the nptll gene cassette containing the modified FRT site is 
excised from NosNpttntFxPal (described in Example 13) as a 840 bp NotI - Sad fragment 
and is ligated into the Not, - Sac, sites of UBQSIntBARnosFOfwd or UBQ3lntBARnosF0rev 
to fom, two Recipient Vector families. F0fwdUBQ3lntBARnosFx[3'lntp'NptlPal and 
F0revUBQ3lntBARnosFxt3'lnt][3'NptlPal. (The brackets enclosing 3Mnt and 3'Npt are to 
indicate that only the 3'-temiinal part of each of these stmctures is present in the plasmid. 
A promoter, the missing part of the intron and the missing portion of Npt v«ll be restored by 
site-specific integration where the recombinase is FLP. F" and F^ sites are oriented parallel 
to one another in F°fwd Recipient and oppositely to one another m F°rev Recipient.) In 
different family members, the modified FRT site FMs either F\ F^ F'. F*or F^ 

For delivenr into plant ceHs using AgrobaCerium. the 3546 bp Sac/-Mu/ fragment 
(F«rev Recipient) or Sac/-X/>o/ fragment (F°fwd Recipient) is cloned into a binary vector at its 
Sad- Miu, sttes or Sad-Sa// sites respectively. The resulting plasmlds are transfomned into 
a helper Agrobacterium strain suitable for the plant host to be transfomied. for example 
LBA4404 or A136(pCIB542) or MP90. 

Example 17: ConsUuction of a Family of Recipient Ve^t°^%«^°"f i^"^ fJ^ 
Expression Cassette, a FLP Expression Cassette a Partial nptll Gene 
Expression Cassette and Wild-type and Modified F/?T Sites 

A family of recipient vectors is constmcted that is identical to those of Example 16, except 
insetted between F^ and the UBQ3 promoter of the bar gene expression is a FLP gene 
expression cassette with an intron in the FLP coding sequence oriented in the same 
direction as bar. 

This family of recipient vectors is constnicted by ligating the FLP gene expression 
cassette of Example 14 as a 3581 bp Xbal fragment into the Xba, site of 
pBS.UBQ3lntBARnos described in Example 16. An intemnediate is identified with the bar 
and FLP expression cassettes both oriented in the same direction and is called 
ActFLPIntTactUBQ3lntBARnos. The F^ site and the terminal half of the npti, gene cassette 
containing the modified FRT site are inserted into ActFLPIntTactUBQ3lntBARnos in 
separate steps exactly as described above in Example 16, fomiing the family of recipient 
vectors FOfwdUBQ3lntBARnosActFLPlntTactFx[3'lnt][3'NptlPal and 
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F0revUBQ3IntBARno8ActFLPIntTactFxl3MH3W^^ The modified FflT site F«is either 
pi p2 p3 F^orF* 

' ' For delivery into plant cells using Agmbacfericm. the 7127 bp Sac/-M/u/ fragment of 
the FLP-contalning F»rev Recipient Vectors and the corresponding 7127 bp SaC-Xho, 
fragment of the F«fwd Recipient Vectors are cloned into a binary vector 
at its Saci- MM sites and Sac/-Sa// sites respectively. The resulting plasm.ds are 
transformed into a helper /^grcl,ac^enum strain suitable for the pl^^^ 

for example LBA4404 or A136(pCIB542) or MP90. 

Example 18: Construction of a Family of Donor ^^f-^^^^f^^'^'' 
wfth the Recipient Families of Examples 16 and 17 

A family of donor vectors is constructed for restoration of nptll gene expression by site- 
specific integration with a recipient vector which has corresponding FRTsites. 

The oligonucleotide pair comprising the v^ld-type FflTsite. F". descdbed in Example 
16 above, is ligated into the Xho/ site of pBluescript forming pBS.FOfwd and pBS.FOrev^ 
The orientation of the F° site is determined by mapping using its asymmetric M/u/ site and 
confirmed by sequencing. The F^fwd construct has the F« core arrow (Example 16) in the 
direction of Kpn, to Sad of the poiylinker. and the F«rev constnict has the opposite 
orientation. The 1272 bp C/a/-EcoR/ fragment of NosNptlntFxPal (Example 13) conta,n,ng 
the r,os promoter, nptll gene coding sequence, and part of the intron up to and including the 
modified FRT site is ligated into the corresponding sites of pBS.FOfwd and pBS.FOrev. 
fomiing two donor vector families. F0fwdNosl5'Npt][5'lnt]Fx and F0revNosl5'Npt]t5-|nt]Fx^ 
(The brackets enclosing 5'lnt and 5'Npt are to indicate that only the 5'-proximal part of each 
of these stn^ctures is present In the plasmid. These are precisely the components 
necessary to complete the corresponding 3--tem,inal parts in the recipient vectors of 
Example 16. in order to fom, an intact nptf/ coding sequence.) The modified Ff?Tsrte site F 
is either F' F^ F^ F*orF^. As for the Recipient families, the Donor family with F^fwd has P 
and r sites in parallel orientation while the family with F^rev has F'' and F« sites oppositely 
oriented. 
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Example 19: Construction of an Excisable Donor Vector for site-specific Integration 

When donor DNA is delivered by Agrobacterium, Initially it is in single-stranded linear form, 
and therefore presumably not a proper substrate for site-specific integration. In order to 
facilitate production of a double-stranded circular fomi of donor DNA. a second F° site is 
introduced into the donor construct that allows FLP-mediated circularization of the donor 

construct between parallel F° sites. 

A wild-type F° site is ligated as an oligonucleotide pair into the Xmal site adjacent to 
the modified FHT site in the two families of donors described in Example 18. The F° 
oligonucleotide is shown below (top strand: 5 '-ccg gGA agt tcc TAT TCC GAA GTT 

CCT ATT CTC TAG AAA GTA TAG GAA CTT CGA GCT C(T) -3 ' . SEQ ID NO:38. F° 

Core Sequence in bold, and the bottom strand: 5 --CCG gag agc tcg aag ttc cta 

TAC TTT CTA GAG AAT AGG AAC TTC GGA ATA GGA ACT TC-3'. SEQ ID NO:39. 

Which also bears an Xmal cohesive end but is othenwise complementary; the extra (T) 
residue outside the Sad site in the oligo is added in order to Inactivate the Xmal site on one 
end of the insert. 

After insertion of the oligo pair Into the F°fwd donor family, clones with the two P 
sites in parallel orientation are Identified by mapping the asymmetric Sad site and 
confim^ed by sequencing. This fom,s the new donor family F0fwdNos[5'Npt][5'lnt]FxF0fwd. 
Similarly, from the F«rev donor family a new F0revNos[5'NptK5'lntlFxF0rev donor family .s 
identified. 

For delivery Into plant cells using Agrobacterium. the F«fwd F° donor family is 
subdoned as a Sac/-Asp7f 8/ fragment into the corresponding sites in a binary vector. The 
F°rev F» donor family is subdoned as an Xmal-Asp718l fragment into the corresponding 
sites In a binary vector. The resulting plasmlds are transfomned into a helper Agrobacterium 
strain suitable for the plant host to be transfomied. for example LBA4404 or A136(pCIB542) 
orMPgo. 

Example 20: Plant Transformation and Selection of BASTA Resistant F°fwd and P'-rev 
Recipient Family Transformants 

Agrobacterium tumefadens strain MP90 Is transfom^ed with the binary vector containing 
F0fwdUBQ3BARnosFxl3'lnt][3'Npt]Pal or F0revUBQ3BARnosFx[3Mntp'Npt]Pal (Example 
16). Transfomiation of Nicotiar^a tabacum suspension ceil line. BY2. with the above 
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^**ms».s IS perto^d e«en«a«, as dsscnbed b, An. G. al (Plan, Molecular 
Blolcw Manual A3: 1-19. Gelvin. S.B. and Schllpe^ort. R.A. Eds. Klu«er Acad.™ 
Pub«she,s, Do«l«oh.«oston/London, 1987). BASTA rasisutn. tobacco uansfomiants a« 
satected and subcultured on MS madia supplamantad v.ith 02 2.4-0. 5-10 rngfl BASTA 

and 500mg/l carbeniciilin. 

Cassette 

Agrobacterium tumefadens strain MP90 is transformed with a bina^r vector containing 
F0fwdUBQ3BARnosActFLPTactFxl3'lntlI3'Npt]Pal °'' 
F0revUBQ3BARnosActFLPTactFxl3'lntll3'Npt]Pal (Example 17). Transfom.at.on of 
Nicoma tabacum suspension cell line. BY2. with the above Agrobacterium strains .s 
perfomied essentially as described by An. G. et al (Plant Molecular Biology Manual A3: 1- 
19 Gelvin. S.B. and Schilperoort. R-A. Eds. Kluwer Academia Publishers. 
Do,drechVBostonA.ondon. 1987). BASTA resistant tobacco transformants are selected and 
subcultured on MS media supplemented with 0.2 mg/1 2.4-D. 5-1 0 mg/1 BASTA and SOOmg/l 
carbenidllin. Transient assays are conducted to measure the relative level of FLP gene 
expression in recipient vector transformants as described in Example 12. Plasmid DNA of 
PAT163. described In Example 2. is used at a concentration of 1.0^g per 3 shots and 
bombarded with one shot into one replicate of each recipient vector transfomiant. 
Luciferase specific activity is determined as described in Example 12 and represents the 
relative level of FLP recombinase activity. 

Example 22: FLP Mediated Cassette Exchange between Donor Vectors and BASTA 
Resistant Recipient Transformants 

Tobacco transformants containing recipient vector DNA of Example 20 undergo site-spedfic 
integration with donor vector DNA (Examples 18 and 19) in the presence of FLP 
recombinase forming kanamycin resistant transformants. The FLP gene is introduced into 
the recipient transformant tobacco cells (BASTA", Kan^) as plasmid DNA of ActlntFLP or 
FxActmtFLPFx (Examples 14 and 15) via microprojectile particle bombardment Plasmid 
DNA of donor vectors F0fwdNos[5'lntlI5'Npt]Fx. F0revNos[5'lnt](5'NptlFx. 
F0fwdNosl51nt][5'Npt]FxF0fwd. or F0revNos[5'lntll5-Npt]FxF0rev is co-precipitated onto 
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gold beads with ActlntFLP or FxActlntFLPFx and co-bomba«led into the transformed cells 
using the DNA precipitation and bombardment procedures described in Example 12. The 
core sequence of the ^ sites and the orientation of the F^ and ^ srtes are identical in the 
recipient transformant and donor DNA to allow for site-specific integration. In the presence 
of FLP recombinase. the DNA flanked by F° and F« in the donor vector undergoes FLP 
mediated cassette exchange with the DNA flanked by the same sites In the recipient 
transformants. giving rise to kanamycin resistant sfte-spedfic Integration transfomiants 
selected on MS medium supplemented with 0.2mg/l 2.4-D. 100mg/l kanamycin and 500mg/I 
carbenicillin. PGR analysis of the kanamycin resistant site-specific integration 
transfomiants further substantiates that the donor DNA is now integrated between the F° 
and F- sites of a prevkjusly Integrated recipient vector, forming an intact nptll gene 
expression cassette as described in Example 18. If the recipient transfomiant contains only 
one copy of the recipient vector DNA. then the ter gene cassette is excised during site- 
specific integration with the donor vector, fomting BASTA^ tobacco cell lines. The recipient 
transfomiants are therefore, thoroughly washed prior to bombardment to eliminate any 
residual BASTA. 

in a separate transfomiatton experiment, the FLP gene is introduced into the 
recipient transfom,ant tobacco cells (BASTA''. Kan^) as T-DNA from A. tumefaciens strain 
MP90. transfomied with a binary vector containing ActlntFLP or FxActlntFLPFx (Examples 
14 and 15). The donor DNA (Example 19) is introduced into the tobacco cell 
simultaneously as a T-DNA from a different A. tumefaciens strain MP90. transfomied with a 
binary vector containing F0fwdNos[5'lnt][5'Npt]FxF0fwd or F0revNos[5Mntl[5'NptlFxF0rev. 
The recipient transformants are thoroughly washed prior to co-cultivation with the above A. 
tumefaciens strains to eliminate any residual BASTA and carijeniclllin. The core sequence 
of the F^ sites and the orientation of all F° and T sites are Identical In the recipient 
transformant and double-stranded circular fomi of donor DNA (described In Example 19) to 
allow for site-specific integration. Kanamycin resistant site-specific integration 
transfomiants are selected and PGR analysis is perfomied as described in the paragraph 
above. 
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Tobacco transformants containing redpient vector DNA of ^^^^^^ 

recomblnase undergo site-spedf,c integration w«h donor vector DNA(Exampies18and1^ 

fomiing kanamycin resistant transfomiants. The donor vector is introduced into the 
recipient transfomnant tobacco cells (aP+.BASTA". Kan^" ) as plasmid DNA of 
F0f,^«lNosl5•|nt]l5•Npt]Fx. F0revNos[5-lnt]l5-Npt]Fx. FOfwdNoslS'lntKS'NptlFxFOfwd. or 
F0revNos[5'lnt]l5'Npt]FxF0rev via microprojectile particle bombardment following 
procedures described in Example 12. The core sequence of the ^ sites and the orientaUon 
of the F° and ^ sites are identical in the recipient transtomiant and donor DNA to altowfor 
site-spedfic integration. In the presence of FLP recombinase. the DNA flanked by F» and F 
in tt,e donor vector undergoes FLP mediated cassette exd^ange with the DNA flanked by 
the same sites in the recipient transfom,ants. giving rise to kanamycin resistant site.spe«f.c 
integration transformants selected on MS medium supplemented with 0.2mg/l 2.4-D. 
100mg/l kanamycin and 500mg/l carbenldllin. PGR analysis of the kanamydn resistant 
site-spedfic integration transfom^ants further substantiates that the donor DNA is now 
integrated between ti.e F« and F« sites of a previously integrated redpient vector, fom^rng an 
intact nptll gene expression cassette as described in Example 18. if the reap.ent • 
transformant contains only one copy of the redpient vector DNA. then the bar and FLP 
gene cassettes are excised during site-specifte Integration with the donor vector. fom.ng 
BASTA^ and FLP(-) tobacco cell lines. The recipient transfomiants are therefore thoroughly 
washed prior to bombardment to eliminate any residual BASTA. 

,n a separate transfomiatlon experiment, the donor DNA (Example 19) is introduced 
into the recipient transformants expressing FLP (FLP..BASTA". Kan^' ) as a T-DNA from A. 
tumefadens strain MP90. transfomied mu a binary vector contain.ng 
FOfwdNoslSWNptlFxFOfwd or F0revNosl5'lntK5'Npt]FxF0rev. The recipient 
transfomiants are thoroughly washed prior to co-cultivation with ti.e above A tumefaaens 
strains to eliminate any residual BASTA and carbenicillin. The core sequence of the F sites 
and the orientation of all F» and F« sites are identical in the redpient transfomiant and 
double-stranded drcular form of donor DNA (described in Example 19) to allow for site- 
spedfic integration. Kanamydn resistant site-spedfic integration transformants are selected 
and PGR analysis is performed as described in ttie paragraph above. 
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What Is ClalnKd Is. 

1 . A method for site-specific modification of a plant genome comprising: 

a) obtaining a plant cell stably transfomied with a first DNA sequence comprising two 
recombinase recognition sites which differ from one another In their spacer; 

b) introducing into said plant cell a second DNA sequence comprising the two 
recombinase recognition sites corresponding to the recombinase recognition sites of 
the first DNA sequence, in the presence of active recombinase(s); 

c) identifying a recomblnase-mediated integration of the second DNA sequence at the 
chromosomal location of ttie first DNA sequence. 

2. The method of dalm 1 . wherein said modification comprises the addition of a nucleotide 
sequence at the chromosomal location of the first DNA sequence. 

3. The method of dalm 1 , wherein said modification comprises the replacement of a 
nucleotide sequence at the chromosomal location of the first DNA sequence by another 
nucleotide sequence at the same chromosomal location. 

4. The method of claim 1 , wherein said modification comprises the removal of a 
nucleotide sequence at the chromosomal location of the first DNA sequence. 

5. The method of claim 1 , wherein said active recombinase is transiently expressed in 
said plant cell. 

6. The method of claim 5. wherein said active recombinase is provided to said plant cell 
as a messenger RNA molecule, wherein said messenger RNA molecule is translatable 
in said plant cell. 

7. The method of daim 5. wherein said active recombinase is provided to said plant cell 
as active protein. 

8. The method of claim 1 . wherein said plant cell Is stably transformed with a DNA 
sequence comprising the coding region of at least one recombinase, wherein said 
coding region is expressible in said plant cell. 
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9. The method of claim 1. wherein the two recombinase recognition sites are derived from 
two different site-specific recombination systems. 

10. The method of claim 1. wherein the two recombinase recognition sites are derived from 
the same site-specific recombination system. 

1 1 . The method of claim 10. wherein the site-specific recombination system Is selected 
from a group consisting of R/RS. GIn/g/xand FLP/FflT. 

12. The method of claim 11. wherein the site-specific recombination system is FLP/FflT. 

13. The method of daim 12. wherein the spacers are selected from a group consisting of 
SEQ ID N0:1-6. 

14 The method of claim 1. wherein the fir^t DNA sequence further comprises outside of 
the DNA stretch comprised between the two recombinase recognition sites the cod.ng 
region of a selectable maker gene or a portion thereof, wherein the 5- end of said 
coding region is adjacent to one of the recombinase recognition sites, wherein said 
selectable marker gene is potentially expressible in said plant cell but not expressed. 

15. The method of claim 14. wherein the 3' end of said coding region Is operably linked to 
termination signals. 

16 The method of claim 14. wherein the first DNA sequence further comprises between 
the two recombinase recognition sites an expression cassette comprising a gene of 
interest expressible in said plant cell. 

17. The method of claim 1 4. wherein the first DNA sequence further comprises between 
the two recombinase recognition sites an expression cassette comprising a visible 
mariner gene expressible in said plant cell. 
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,a The method of clelm 14, «here.n the f»e. ONA sequence lu*er oompHses bet»ee„ 

■ the two «cor*.nase ^cosniUoo s«es an expression oessene «»np.is,n9 a nejatwe 
selecteble maiker gene expressible in said plant cell. 

,9 The method of Calm f 4, »he,eln the second DNA sequence fur«», con,pdsest«««en 

■ „e two recombinase recognWon sites a pn.mo.er capable of dlrectln, the expression of 
a gene In a plan, cell, wherein said pronator Is adiacen, to one of the recosn^on s^ 
and IS oriented such that I. Is capable of directing transcription towards the ou»le of 
the DNA stretch comprised between the two recombinase recognition sHes, where« a 
functional fusion between said coding region of the first DNA sequence and saKi 
promoter Is created upon recombinaseHnedlated Integration of the second DNA 
sequence at the chiomosomal location of the first DNA sequence. 

20 The method of dalm 14, wherein the second DNA sequence further comprises between 
■ «,e two recombinase recognition sites an expression cassene comprising a gene ol 

interest expressible in said plant cell. 

2, The method 0. cUlm 14, wherein me second DNA sequence further comprises between 
' the m> recombhase mcognWon sites an expression casseUe compnsing a .rsrble 
marker gene expressible in said plant cell. 

22 The method of claim 1 4. wherein the second DNA sequence further comprises be^.ee^ 
■ the two recombinase recognition sites one or more additional recombinase recognrtion 
sites different from the other two recombinase recognition sites in said DNA sequence 
and the coding sequence of a selectable marker gene, wherein the 5' end of said 
coding region is adjacent to one of the additional recombinase recognition sites, 
wherein said selectable marker gene is expressible in said plant cell. 

23. The method of daim14.wherein the 3- end of said coding region is operably linked to 

termination signals. 

24 The method of claim 1 . wherein the first DNA sequence further comprises between the 
■ two recombinase recognition sites an expression cassette comprising a selectable 
marker gene e)qpressible in said plant cell. 
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25 The memod 01 claim 24. wherein me firs, DNA seq-ence ,u,mer compHsee 

■ me two reooThbinase recogniUon sites eh expressloh cassette comprtslhg a gene o, 
interest expressible in said plant cell. 

26 Thernethodofclaim24,whereinthefirstDNAsequenoefurthercornprisesbetwe^^^ 

■ the two recombinase recognition sftes an expression cassette comprising a negat.ve 
selectable marker gene expressible in said plant cell. 

27 The method of claim 24. wherein the first DNA sequence further comprises between 

■ the two recombinase recognition sites an expression cassette comprising a visible 
maricer gene expressible in said plant cell. 

28 The method of dalm 24. wheraln the second DNA sequence further comphsss between 

■ the ««. recombinase recognition s«ee an expression casseUe comprising a selectable 
mariner gene expressible in said plant cell. 

29 The method of dalm 24, wherein the second ONA sequence further comprises between 
the two recombinase recognition sites an expression casseUe composing a gene of 
interest expressible in said plant cell. 

30 The method of ..aim 24. wherein the second DNA sequence further compdses bet«een 
the two recombinase recognition sites an expression cassette comprtsUig a «,s*le 

marker gene expressible in said plant cell. 

31 The method of claim 24. wherein the second DNA sequence further comprises between 
■ the two recombinase recognition sites one or more additional recombinase rec«gnrt.on 

sites different from the two other recombinase recognition sites of said DNA sequence. 

32. A plant cell stably transfomied with a DNA sequence comprising two different 
recombinase recognition sites. 

33 The plant cell of claim 32. further comprising outside of the two recombinase 

recognition sites the coding region of a selectable maker gene, wherein the 5' end of 



wo 99/55851 



-46- 



PCT/EP99/02813 



saM a3*HI is adlaoen. » «>e ^ me recognise s«es, herein said 

se,eo«ble««-k«flS.e is p<««-y.»press«eU, saw P«..ce«bu, no, expressed. 

34. -n-e plan. ca. o. data 33, «hareln «,e 3' end ol said ceding region is operably linked u> 

termination signals. 

35. The plan, cell o. claim 33. .u*er compdslng be.«een «,e Wo recomblnase recognMon 
sltts an expression casseue con^Hslng a visible ma*er gene axpresslMe ,n sa,d plan, 
cell. 

36 The plant ceH of claim 33. further comp.isir,g betweer, the two recomblnase recognitor, 

■ sites ar, expression cassette comprising a selectable marker gene expressible in said 
plant ceil. 

37 The plant cell of claim 33. further comprising between the two recombin.se recognition 

■ sites an exp-^ssion cassette comprising a negative selectable marker gene express.ble 
in said plant cell. 

38. A plant regenerated from the plant cell of claim 32. 

39. A plant obtained by the method of claim 1 . 
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SBOOENCE lilSriNS 

<110> Novartis AG 

<120> Site-directed transfoncation of plants 

<130> S-30481/A/CGC 2000 

<140> 

<141> > 

<150> US 09/067552 
<151> 1998-04-28 

<160> 39 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 8 
<212> ENA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: vdld-type 
spacer of FRT site 

<400> 1 
tctagaaa 

<210> 2 
<211> 8 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutant spacer 
of FB^ site 

<400> 2 
ttcaaaag 

<210> 3 
<211> 8 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutant spacer 
of FRT site 

<400> 3 
ttcaaata 



<210> 4 
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<211> 8 
<212> Wk 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Seqamce: mutant ^cer 
of ERT site 

<400> 4 3 
tctactta 

<210> 5 
<211> 8 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutant spacer 
of FKT site 

<400> 5 3 
tctagaag 

<210> 6 
<211> 8 
<212> im 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mutant spacer 
of FRT site 

<400> 6 3 
tctagata 

<210> 7 
<211> 61 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

gatc^Lgt tcctattccg aagttcctat tctctagaaa gtataggaac ttcgaattcc 60 



<210> 8 

<211> 61 
<212> Wk 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
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oligonucleotide 



^S'tg^t tcgaagttcc tatactttct agagaatagg aacttcggaa taggaacttc 60 
g 



<210> 9 
<211> 54 
<212> Efi5A 

<213> Artificial Secauence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

^?SgLgt tcctattccg aagttcctat tcttcaaata gtataggaac ttca 54 

<210> 10 
<211> 54 
<212> ENA 

<213> Artificial Sequence 

<220> . , ^ 

<223> Description of Artificial Sequence: 
oligonucleotide 

^Stg^ tcctatacta tttgaagaat aggaacttcg gaataggaac ttcg 54 

<210> 11 
<211> 54 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

^2c^^ tcctattccg aagttcctat tcttcaaaag gtataggaac ttca 54 

<210> 12 
<211> 54 
<212> ENA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

^[SgSgt tcctatacct tttgaagaat aggaacttcg gaataggaac ttcg 54 



<210> 13 
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<211> 54 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Descripticxi of Artificial Sequence: 
oligonucleotide 

giJSgSgt tcctattccg aagttcctat tctctagaaa gtataggaac ttca 54 

<210> 14 
<211> 54 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

^tctg^ tcctatactt tctagagaat aggaacttcg gaataggaac ttcg 54 

<210> 15 
<211> 20 
<212> ENA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequ^ice: 
oligonucleotide 

<400> 15 20 
acacggcggc atcagagcag 

<210> 16 
<211> 25 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 16 25 
taacagatct ccccctggcg aaagg 

<210> 17 
<211> 50 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 
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2^tcct attccgaagt tcctattctc tagaaagtat aggaacttcg 50 

<210> 18 
<211> 58 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oliganucleotide 

iltt^gt tcctatactt tctagagaat aggaacttcg gaataggaac ttcggtac 58 

<210> 19 
<211> 54 
<212> ENA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

tJ^^Sgt tcctattcog aagttcctat tcttcaaaag gtataggaac ttcg 54 

<210> 20 
<211> 54 
<212> nsiA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 20 

aattcgaagt tcctatacct tttgaagaat aggaacttcg gaataggaac ttcg 

<210> 21 
<211> 54 
<212> im 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 21 ^ 
tcgacgaagt tcctattccg aagttcctat tcttcaaata gtataggaac ttca o« 



<210> 22 
<211> 54 
<212> ENA 
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<213> Artificial Sequence 

<220> , , ^ 

<223> Description of Artificial Sequence: 
oligonucleotide 

ScttgSgt tcctatacta tttgaagaat aggaacttcg gaataggaac ttog 54 

<210> 23 
<211> 58 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligomcleotide 

^S^ttc gaagttccta ttccgaagtt cctattcttc aaaaggtata ggaacttc 58 

<210> 24 
<211> 50 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 24 . . ^ cA 

cggaagttcc tatacctttt gaagaatagg aacttcggaa taggaacttc 

<210> 25 
<211> 60 
<212> Wk 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequoice: 
oligcxiucleotide 

tJ^a^^t cgaagttcct attccgaagt tcctattctc tagaaagtat aggaacttcg 60 

<210> 26 
<211> 60 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 



<400> 26 
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tcgacgaagt tcctatactt tctagagaat aggaacttcg gaataggaac ttcgacgtcg 60 

<210> 27 
<211> 22 
<212> CNA 

<213> Artificial Sequence 
<220> 

<223> Descriptian of Artificial Sequence: 
oligonucleotide 

<400> 27 22 
catggacctg aaacaaaaca at 

<210> 28 
<211> 20 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequaice: 
oligonucleotide 

<400> 28 20 
ogattgtttt gtttcaggtc 

<210> 29 
<211> 10 
<212> rWA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequeice: 
oligonucleotide 

<400> 29 10 
gtacagatct 

<210> 30 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

l^tgc^cc gcgaagttcc tattccgaag ttcctattct ctagatagta taggaacttc 60 
gaattc 

<210> 31 
<211> 66 
<212> ENA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificieil Sequence: 
oligonucleoticle 

<400> 31 ^. 
aattgaattc gaagttccta tactatctag agaataggaa cttcggaata ggaacttcgc 60 

ggcogc 

<210> 32 
<211> 57 
<212> ENA 

<213> Artificial Sequence ^ 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 32 ^„ 
gtaccgaagt tcctattccg aagttcctat tcnnnnnnnn gtataggaac ttcggcc 

<210> 33 
<211> 49 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 33 

gaagttccta tacnrmnnnn ngaataggaa cttcggaata ggaacttcg 

<210> 34 
<211> 50 

<212> rm 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 34 

gaagttccta ttccgaagtt cctattcnnn nnnnngtata ggaacttcgc 3U 

<210> 35 
<211> 58 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 



<400> 35 
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tcgacttcaa ggataaggct tcaaggataa gnnnnnnnnc atatccttga agcgccgg 58 

<210> 36 
<211> 58 
<212> ENA 

<213> Artificial Sequoice 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

tSagai^t cctattccga agttcctatt ctctagaaag tataggaact tcacgcgt 58 

<210> 37 
<211> 58 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

tJS2^4Igt gaagttccta tactttctag agaataggaa cttcggaata ggaacttc 58 

<210> 38 
<211> 59 

<2i2> im 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

cSggaigtt cctattccga agttcctatt ctctagaaag tataggaact tcgagctct 59 

<210> 39 
<211> 59 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotidfi 

Sggagagct cgaagttcct atactttcta gagaatagga acttcggaat aggaacttc 59 
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